(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property 
Organization 

International Bureau 

(43) International Publication Date 
6 January 2005 (06.01.2005) 




(10) Internationa] Publication Number 

PCT WO 2005/001103 A2 



(51) International Patent Classification 7 : C12N 15/86, 
5/10, 7/01, C07K 14/075, A61K 39/235, 48/00, C12N 
15/34 

(21) International Application Number: 

PCT/US2004/01 66 14 



WILSON, James, M. [US/US]; 1350 N. Avignon Drive, 
Gladwyne, PA 19035 (US). 

(74) Agents: KODROFF, Cathy, A. et aL; Howson and How- 
son, Spring House Corporate Center, P.O. Box 457, Spring 
House, PA 19477 (US). 



(22) 


Internationa] Filing Date: 


15 June 2004 (15.06.2004) 


(25) 


Filing Language: 


English 


(26) 


Publication Language: 


English 


(30) 


Priority Data: 





10/465,302 
60/566,212 
60/575,429 



20 June 2003 (20.06.2003) US 
28 April 2004 (28.04.2004) US 
28 May 2004 (28.05.2004) US 



(63) Related by continuation (CON) or continuation-in-part 
(CIP) to earlier application: 

US 10/465,302 (CIP) 

Filed on 20 June 2003 (20.06.2003) 

(71) Applicant (for all designated States except US): THE 
TRUSTEES OF THE UNIVERSITY OF PENN- 
SYLVANIA [US/US]; 3160 Chestnut Street, Suite 200, 
Philadelphia, PA 19104 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): ROY, Soumitra 

[US/US]; 240 Pugh Road, Wayne, PA 19087 (US). 



(81) Designated States (unless otherwise indicated, for every 
kind of national protection available): AE, AG, AL, AM, 
AT, AU, AZ, BA, BB, BG, BR, BW, BY, BZ, CA, CH, CN, 
CO, CR, CU, CZ, DE, DK, DM, DZ, EC, EE, EG, ES, FI, 
GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, 
KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, MD, 
MG, MX, MN, MW, MX, MZ, NA, NI, NO, NZ, OM, PG, 
PH, PL, PT, RO, RU, SC, SD, SE, SG, SK, SL, S Y, TJ, TM, 
TN, TR, TT, TZ, UA, UG, UZ, VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (unless otherwise indicated, for every 
kind of regional protection available): ARIPO (BW, GH, 
GM, KE, LS, MW, MZ, NA, SD, SL, SZ, TZ, UG, ZM, 
ZW), Eurasian (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European (AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, 
FR, GB, GR, HU, IE, IT, LU, MC, NL, PL, PT, RO, SE, SI, 
SK, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, 
GW, ML, MR, NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations'* appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

m 
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(57) Abstract: A method for providing an adenovirus from a serotype which does not grow efficiently in a desired cell line with 
O the ability to grow in that cell line is described. The method involves replacing the left and right termini of the adenovirus with the 
© corresponding termini from an adenovirus which grow efficiently in the desired cell line. At a minimum, the left terminus spans 
^ the (5*) inverted terminal repeat, the left terminus spans the E4 region and the (3^ inverted terminal repeat. The resulting chimeric 
Q adenovirus contains the internal regions spanning the genes encoding the penton, hexon and fiber from the serotype which does not 
^ grow efficiently in the desired cell. Also provided are vectors constructed from novel simian adenovirus sequences and proteins, 

host cells containing same, and uses thereof. 
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METHODS OF GENERATING CHIMERIC ADENOVIRUSES 
AND USES FOR SUCH CHIMERIC ADENOVIRUSES 

BACKGROUND OF THE INVENTION 

5 The presence of humoral immunity (circulating antibodies) to adenovirus 

capsid proteins is a barrier to the use of adenovirus vectors for gene therapy. The 
prototype adenovirus vectors that have been developed for gene therapy are based on 
subgroup C adenoviruses such as that of serotype 5. The prevalence of neutralizing 
antibodies against subgroup C adenoviruses is generally high in human populations as 

10 a result of frequent exposure to these pathogens. This fact is likely to greatly limit the 
effectiveness of gene therapy vectors based on serotypes such as Ad5. 

Analysis of the nature of the protective antibodies against adenoviruses has 
indicated that the most important target is the major capsid protein, hexon [Wolfhart 
(1988) J. Virol 62, 2321; Gall et al (1996) 1 Virol 70, 21 16]. Several efforts have 

1 5 been made to engineer the hexon so as to evade the anti-hexon antibodies by making 
chimeric adenoviruses harboring hexons from other serotypes [Roy et al (1998) J. 
Virol. 72, 6875; US Patent No 5,922,315; Gall et al (1998) J. Virol 72, 10260; Youil 
et al (2002) Hum. Gene Ther. 13, 311; Wu et al (2002) J. Virol 76, 12775]. 
However, this has been largely unsuccessful when exchanges among distant serotypes 

20 are attempted. 

Alternatively, investigators have proposed using adenovirus vectors that 
rarely cause human infections or using adenoviruses from non-human sources. 
However, the lack of a practical manner in which to produce large numbers of such 
vectors has proved to be a hindrance to developing such vectors. 

25 



SUMMARY OF THE INVENTION 

The present invention provides a method of modifying adenoviruses having 
capsids, and particularly, including hexons, from serotypes which are not well 
30 adapted for growth in cells useful for adenoviral virion production. The method is 
useful for production of scalable amounts of adenoviruses. The resulting chimeric 
adenovirus genomes are useful for a variety of purposes which are described herein. 
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The invention further provides novel, isolated, adenovirus SA18 nucleic acid 
and amino acid sequences, vectors containing same, cell lines containing such SA18 
sequences and/or vectors, and uses thereof. 

Other aspects and advantages of the present invention will be readily apparent 
5 from the following detailed description of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 provides the map of the genome of the simian adenovirus generated 
by shotgun cloning as described in the examples below. 
10 Fig. 2 provides the map of the recombinant Adhu5-SV25 chimeric virus, 

termed H5S25H5. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides chimeric adenovirus genomes composed of 
1 5 the left terminal end and right terminal end of an adenovirus which can be cultured in 
the selected host cell, and the internal regions encoding, at a minimum, the capsid 
proteins of another adenovirus serotype. This invention is particularly advantageous 
for generating adenoviruses having serotypes which are difficult to culture in a 
desired cell type. The invention thus permits generation of chimeric adenoviruses 
20 vectors of varying serotypes. 

In the embodiments illustrated herein, chimeric adenoviruses have been 
constructed where most structural proteins, and not merely the hexon or fiber, are 
derived from an adenovirus of an unrelated serotype, thereby preserving the majority 
of the protein-protein interactions that are involved in capsid assembly. Most of the 
25 early genes such as those encoded by the adenovirus El and E4 regions that are 
responsible for transcription regulation and regulation of the host cell cycle, are 
retained from a different serotype that is known to result in high titer virus generation 
in the commonly used cell types, such as HEK 293 which supplies the Ad5 El 
proteins in trans. 

30 In another embodiment, the invention provides novel nucleic acid and amino 

acid sequences from Ad SA18, which was originally isolated from vervet monkey 
[ATCC VR-943]. The present invention further provides novel adenovirus vectors 
and packaging cell lines to produce those vectors for use in the in vitro production of 
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recombinant proteins or fragments or other reagents. The invention further provides 
compositions for use in delivering a heterologous molecule for therapeutic or vaccine 
purposes. Such therapeutic or vaccine compositions contain the adenoviral vectors 
carrying an inserted heterologous molecule. In addition, novel sequences of the 
5 invention are useful in providing the essential helper functions required for 

production of recombinant adeno-associated viral (AAV) vectors. Thus, the invention 
provides helper constructs, methods and cell lines which use these sequences in such 
production methods. 

The term "substantial homology" or "substantial similarity," when referring to 

1 0 a nucleic acid or fragment thereof, indicates that, when optimally aligned with 
appropriate nucleotide insertions or deletions with another nucleic acid (or its 
complementary strand), there is nucleotide sequence identity in at least about 95 to 
99% of the aligned sequences. 

The term "substantial homology" or "substantial similarity," when referring to 

1 5 amino acids or fragments thereof, indicates that, when optimally aligned with 
appropriate amino acid insertions or deletions with another amino acid (or its 
complementary strand), there is amino acid sequence identity in at least about 95 to 
99% of the aligned sequences. Preferably, the homology is over full-length sequence, 
or a protein thereof, or a fragment thereof which is at least 8 amino acids, or more 

20 desirably, at least 15 amino acids in length. Examples of suitable fragments are 
described herein. 

The term "percent sequence identity" or "identical" in the context of nucleic 
acid sequences refers to the residues in the two sequences that are the same when 
aligned for maximum correspondence. The length of sequence identity comparison 

25 may be over the full-length of the genome (e.g., about 36 kbp), the full-length of an 
open reading frame of a gene, protein, subunit, or enzyme [see, e.g., the tables 
providing the adenoviral coding regions], or a fragment of at least about 500 to 5000 
nucleotides, is desired. However, identity among smaller fragments, e.g., of at least 
about nine nucleotides, usually at least about 20 to 24 nucleotides, at least about 28 to 

30 32 nucleotides, at least about 36 or more nucleotides, may also be desired. Similarly, 
"percent sequence identity" may be readily determined for amino acid sequences, 
over the full-length of a protein, or a fragment thereof. Suitably, a fragment is at least 
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about 8 amino acids in length, and may be up to about 700 amino acids. Examples of 
suitable fragments are described herein. 

Identity is readily determined using such algorithms and computer programs 
as are defined herein at default settings. Preferably, such identity is over the fiill 
5 length of the protein, enzyme, subunit, or over a fragment of at least about 8 amino 
acids in length. However, identity may be based upon shorter regions, where suited to 
the use to which the identical gene product is being put. 

As described herein, alignments are performed using any of a variety of 
publicly or commercially available Multiple Sequence Alignment Programs, such as 

10 "Clustal W", accessible through Web Servers on the internet. Alternatively, Vector 
NTI utilities are also used. There are also a number of algorithms known in the art 
that can be used to measure nucleotide sequence identity, including those contained in 
the programs described above. As another example, polynucleotide sequences can be 
compared using Fasta, a program in GCG Version 6.1. Fasta provides alignments and 

15 percent sequence identity of the regions of the best overlap between the query and 
search sequences. For instance, percent sequence identity between nucleic acid 
sequences can be determined using Fasta with its default parameters (a word size of 6 
and the NOPAM factor for the scoring matrix) as provided in GCG Version 6.1, 
herein incorporated by reference. Similarly programs are available for performing 

20 amino acid alignments. Generally, these programs are used at default settings, 

although one of skill in the art can alter these settings as needed. Alternatively, one of 
skill in the art can utilize another algorithm or computer program that provides at least 
the level of identity or alignment as that provided by the referenced algorithms and 
programs. 

25 As used throughout this specification and the claims, the term "comprise" and 

its variants including, "comprises", "comprising", among other variants, is inclusive 
of other components, elements, integers, steps and the like. The term "consists of 5 or 
"consisting of are exclusive of other components, elements, integers, steps and the 
like. 

30 Except where otherwise specified, the term 'Vector" includes any genetic 

element known in the art which will deliver a target molecule to a cell, including, 
naked DNA, a plasmid, phage, transposon, cosmids, episomes, viruses, etc. 
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By "minigene" is meant the combination of a selected heterologous gene and 
the other regulatory elements necessary to drive translation, transcription and/or 
expression of the gene product in a host cell. 

As used herein, the term "transcomplement" refers to when a gene (gene 
5 product) of one adenovirus serotype supplies an adenovirus serotype lacking this gene 
(gene product) from another serotype with the missing function. For example, human 
adenovirus serotype 5 El a and Elb functions are known to transcomplement El- 
deleted chimpanzee adenovirus Pan 9. Similarly, the inventors have found that 
human Ad5 El transcomplements El-deleted chimpanzee adenovirus serotypes Pan5, 
10 Pan6, Pan7, and simian adenovirus serotypes SV1, SV25 and SV39. Other examples 
of transcomplementing serotypes include human Ad5 and human Ad2, Ad3, Ad4, 
Ad5,Ad7, and Adl2. 

The term "functionally deleted" or "functional deletion" means that a 
sufficient amount of the gene region is removed or otherwise damaged, e.g., by 
1 5 mutation or modification, so that the gene region is no longer capable of producing 
functional products of gene expression. If desired, the entire gene region may be 
removed. Other suitable sites for gene disruption or deletion are discussed elsewhere 
in the application. 

The term "functional" refers to a product (e.g., a protein or peptide) which 
20 performs its native function, although not necessarily at the same level as the native 
product. The term "functional" may also refer to a gene which encodes and from 
which a desired product can be expressed. 

I. Chimeric Adenoviral Vectors 

25 The compositions of this invention include chimeric adenoviral vectors that 

deliver a heterologous molecule to cells. For delivery of such a heterologous 
molecule, the vector can be a plasmid or, preferably, a chimeric adenovirus. The 
chimeric adenoviruses of the invention include adenovirus DNA from at least two 
source serotypes, a "donating serotype" and a "parental adenovirus" as described in 

30 more detail herein, and a minigene. 

Because the adenoviral genome contains open reading frames on both strands, 
in many instances reference is made herein to 5' and 3' ends of the various regions to 
avoid confusion between specific open reading frames and gene regions. Thus, when 
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reference is made herein to the "left" and "right" end of the adenoviral genome, this 
reference is to the ends of the approximately 36 kb adenoviral genome when depicted 
in schematic form as is conventional in the art [see, e.g., Horwitz, "Adenoviridae and 
Their Replication", in VIROLOGY, 2d ed., pp. 1679-1721 (1990)]. Thus, as used 
5 herein, the "left terminal end" of the adenoviral genome refers to portion of the 

adenoviral genome which, when the genome is depicted schematically in linear form, 
is located at the extreme left end of the schematic. Typically, the left end refers to be 
portion of the genome beginning at map unit 0 and extending to the right to include at 
least the 5' inverted terminal repeats (ITRs), and excludes the internal regions of the 
10 genome encoding the structural genes. As used herein, the "right terminal end" of the 
adenoviral genome refers to portion of the adenoviral genome which, when the 
genome is depicted schematically in linear form, is located at the extreme right end of 
the schematic. Typically, the right end of the adenoviral genome refers to be portion 
of the genome ending at map unit 36 and extending to the left to include at least the 3' 
1 5 ITRs, and excludes the internal regions of the genome encoding the structural genes. 
A. Adenovirus Regulatory Sequences 
1 . Serotype 

The selection of the adenovirus serotype donating its left 
terminal end and right terminal end can be readily made by one of skill in the art from 
20 among serotypes which can readily be cultured in the desired cell line. Among other 
factors which may be considered in selecting the serotype of the donating serotype is 
compatibility with the adenovirus serotype which will be supplying the internal 
regions at the location at which their sequences are hybridized. 

Suitable adenoviruses for donating their left and right termini 
25 are available from the American Type Culture Collection, Manassas, Virginia, US 
(ATCC), a variety of academic and commercial sources, or the desired regions of the 
donating adenoviruses may be synthesized using known techniques with reference to 
sequences published in the literature or available from databases (e.g., GenBank, 
etc.). Examples of suitable donating adenoviruses include, without limitation, human 
30 adenovirus serotypes 2, 3, 4, 5, 7, and 12, and further including any of the presently 
identified human types [see, e.g., Horwitz, "Adenoviridae and Their Replication", in 
Virology, 2d ed., pp. 1679-1721 (1990)] which can be cultured in the desired cell. 
Similarly adenoviruses known to infect non-human primates (e.g., chimpanzees, 
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rhesus, macaque, and other simian species) or other non-human mammals and which 
grow in the desired cell can be employed in the vector constructs of this invention. 
Such serotypes include, without limitation, chimpanzee adenoviruses Pan 5 [VR-591], 
Pan6 [VR-592], Pan7 [VR-593], and C68 (Pan9), described in US Patent No. 
5 6,083,716; and simian adenoviruses including, without limitation SV1 [VR-195]; 
SV25 [SV-201]; SV35; SV15; SV-34; SV-36; SV-37, and baboon adenovirus [VR- 
275], among others. The sequences of Pan 5 (also termed C5), Pan 6 (also termed 
C6), Pan 7 (also termed C7), SV1, SV25, and SV39 have been described [WO 
03/046124, published 5 June 2003; and in US Patent Application No. 10/739,096, 

10 filed December 19, 2003)], which are incorporated by reference. In the following 

examples, the human 293 cells and adenovirus type 5 (Ad5), Pan9, and Ad40 are used 
for convenience. However, one of skill in the art will understand that other cell lines 
and/or comparable regions derived from other adenoviral strains may be readily 
selected and used in the present invention in the place of (or in combination with) 

15 these serotypes. 

2. Sequences 

The minimum sequences which must be supplied by the 
adenovirus donating its left terminal end and its right terminal end include the 5' cis- 
elements and the 3' cis-elements necessary for replication and packaging. Typically, 

20 the 5' cis-elements necessary for packaging and replication include the 5 5 inverted 
terminal repeat (ITR) sequences (which functions as origins of replication) and the 
native 5' packaging enhancer domains (that contain sequences necessary for 
packaging linear Ad genomes and enhancer elements for the El promoter). The right 
end of the adenoviral genome includes the 3' cis-elements (including the ITRs) 

25 necessary for packaging and encapsidation. Desirably, the adenovirus serotype 
donating its left and right termini and/or an adenovirus serotype which 
transcomplements the serotype of the donating adenovirus, further provides the 
functions of the necessary adenovirus early genes, including El (Ela and Elb), E2 
(E2a and E2b), and E4 (including at least the ORF6 region). E3 is not essential and 

30 may be deleted as desired, e.g., for insertion of a transgene in this region or to provide 
space for a transgene inserted in another region (typically for packaging it is desirable 
for the total adenoviral genome to be under 36 kb). 
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In certain embodiments, the necessary adenovirus early genes 
are contained in the chimeric construct of the invention. In other embodiment, one or 
more of the necessary adenovirus early genes can be provided by the packaging host 
cell or in trans. 

5 In general, the chimeric adenovirus of the invention contains 

regulatory sequences from the donating adenovirus serotype, or a transcomplementing 
serotype, to provide the chimeric adenovirus with compatible regulatory proteins. 
Optionally, one or more of the necessary adenoviral structural genes is provided by 
the adenovirus donating its left terminal and its right terminal end. 

10 In certain embodiments, the chimeric adenovirus further 

contains one or more functional adenovirus genes, including, the Endoprotease open 
reading frame, DNA binding protein, 100 kDa scaffolding protein, 33 kDa protein, 
protein VIII, pTP, 52/55 kDa protein, protein VII, Mu and/or protein VI from the 
adenovirus serotype donating its left and right termini. Where all of these genes are 

15 derived from the adenovirus serotype donating the 5' and 3' ITRs, a "pseudotyped" 
virus is formed. In one embodiment, the chimeric adenovirus contains the left end of 
the adenovirus genome from the donating serotype, from the 5' ITR through the end 
of the pol gene (or the pTP). In another embodiment, the chimeric adenovirus 
contains the left end of the donating adenovirus serotype, from the 5 5 ITR through the 

20 penton. In yet another embodiment, the chimeric adenovirus contains the left end of 
the donating adenovirus serotype, e.g., through the end of pTP, but contains an ITR 
from an adenovirus serotype heterologous to the donating adenovirus serotype. Still 
other embodiments will be readily apparent from the present disclosure. 

Optionally, one or more of the genes can be hybrids formed 

25 from the fusion of the donating adenovirus serotype and the parental adenovirus 

serotype providing the capsid proteins {e.g., without limitation, polymerase, terminal 
protein, EQa protein). Suitably, these genes express functional proteins which permit 
packaging of the adenovirus genes into the capsid. Alternatively, one or more of 
these proteins (whether hybrid or non-hybrid) can be functionally deleted in the 

30 chimeric adenovirus. Where desired, any necessary proteins functionally deleted in 
the chimeric adenovirus can be expressed in trans in the packaging cell. 
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B. Parental Adenovirus Structural Proteins 
1 . Serotypes 

This invention is particularly well adapted for use in 
generating chimeric adenoviruses in which the capsid proteins are from a parental 
5 adenovirus which does not efficiently grow in a desirable host cell. The selection of 
the parental adenovirus serotype providing the internal regions can be readily made 
by one of skill in the art based on the information provided herein. 

A variety of suitable adenoviruses can serve as a parental 
adenovirus supplying the regions encoding the structural (ie., capsid proteins). Many 
10 of such adenoviruses can be obtained from the same sources as described above for 
the donating adenovirus serotypes. Examples of suitable parental adenovirus 
serotypes includes, without limitation, human adenovirus serotype 40, among others 
[see, e.g., Horwitz, "Adenoviridae and Their Replication", in VIROLOGY, 2d ed., pp. 
1679-1721 (1990)], and adenoviruses known to infect non-human primates {e.g., 
15 chimpanzees, rhesus, macaque, and other simian species) or other non-human 

mammals, including, without limitation, chimpanzee adenovirus CI, described in US 
Patent No. 6,083,716, which is incorporated by reference; simian adenoviruses, and 
baboon adenoviruses, among others. In addition, the parental adenovirus supplying 
the internal regions may be from a non-naturally occurring adenovirus serotype, such 
20 as may be generated using a variety of techniques known to those of skill in the art. 

In one embodiment illustrated herein, a chimeric virus that was constructed 
was that between the chimpanzee adenoviruses Pan-5 and CI exhibited a higher titer 
in human 293 cells than the wild-type parental virus. However, the invention is not 
limited to the use of these chimpanzee adenoviruses, or to the combination of simian- 
25 simian, human-human, or simian-human chimeric adenoviruses. For example, it may 
be desirable to utilize bovine or canine adenoviruses, or other non-human mammalian 
adenoviruses which do not naturally infect and/or replicate in human cells. 

In the following examples, the human adenovirus type 40 
(Ad40) and the chimpanzee adenovirus CI, simian Pan 5 and Ad40, and Pan 5 and 
30 simian adenovirus SA1 8, are used. However, one of skill in the art will understand 
that other adenoviral serotypes may be readily selected and used in the present 
invention in the place of (or in combination with) these serotypes. 
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2. Sequences 

The parental adenovirus provides to the chimeric construct of 
the invention its internal regions which includes structural proteins necessary for 
generating a capsid having the desired characteristics of the parental adenovirus. 
5 These desired characteristics include, but are not limited to, the ability to infect target 
cells and delivery a heterologous transgene, the ability to elude neutralizing 
antibodies directed to another adenovirus serotype (i.e., avoiding clearance due to 
cross-reactivity), and/or the ability to infect cells in the absence of an immune 
response to the chimeric adenovirus. The advantages of such characteristics may be 

10 most readily apparent in a regimen which involves repeat delivery of adenoviral 
vectors. The left and right termini of the parent adenovirus, including at least the 5' 
ITRs, the El region, the E4 region and the 3' ITRs are non-functional and, preferably, 
completely absent. Optionally, all adenovirus regulatory proteins from this parental 
adenovirus are non-functional and only the structural proteins (or selected structural 

1 5 proteins) are retained. 

At a minimum, the parental adenovirus provides the 
adenoviral late region encoding the hexon protein. Suitably, the parental adenovirus 
further provides the late regions encoding the penton and the fiber. In certain 
embodiments, all of the functional adenoviral late regions, including LI (encoding 

20 52/55 Da, Eta proteins), L2 (encoding penton, VII, V, Mu proteins), L3 (encoding VI, 
hexon, Endoprotease), L4 (encoding 100 kD, 33 kD, VIE proteins) and L5 (encoding 
fiber protein) are supplied by the parental adenovirus. Optionally, one or more of 
these late gene functions, with the exception of those encoding the hexon, penton and 
fiber proteins, can be functionally deleted. Any necessary structural proteins may be 

25 supplied in trans. 

Thus, in certain embodiments, the chimeric adenovirus further 
contains one or more functional adenovirus genes, including, the Endoprotease open 
reading frame, DNA binding protein, 100 kDa scaffolding protein, 33 kDa protein, 
protein VHI, pTP, 52/55 kDa protein, protein VII, Mu and/or protein VI from the 
30 parental adenovirus donating its internal regions. Optionally, one or more of the 

genes can be hybrids formed from the fusion of the donating adenovirus serotype and 
the parental adenovirus serotype providing the capsid proteins, as described above. 
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C. The "Minigene" 

Typically, an adenoviral vector of the invention is designed to contain 
a minigene which may be inserted into the site of a partially deleted, fully deleted 
(absent), or disrupted adenoviral gene. For example, the minigene may be located in 
5 the site of such a functional El deletion or functional E3 deletion, or another suitable 
site. 

The methods employed for the selection of the transgene, the cloning 
and construction of the "minigene" and its insertion into the viral vector are within the 
skill in the art given the teachings provided herein. 

10 1. The transgene 

The transgene is a nucleic acid sequence, heterologous to the 
vector sequences flanking the transgene, which encodes a polypeptide, protein, or 
other product, of interest. The nucleic acid coding sequence is operatively linked to 
regulatory components in a manner which permits transgene transcription, translation, 

1 5 and/or expression in a host cell. 

The composition of the transgene sequence will depend upon 
the use to which the adenoviral vector will be put. For example, the adenoviral vector 
may be used as a helper virus in production of recombinant adeno-associated viruses 
or in production of recombinant adenoviruses deleted of essential adenoviral gene 

20 functions which are supplied by the adenoviral vector, or for a variety of production 
uses. Alternatively, the adenoviral vector may be used for diagnostic purposes. 

One type of transgene sequence includes a reporter sequence, 
which upon expression produces a detectable signal. Such reporter sequences 
include, without limitation, DNA sequences encoding 0-lactamase, P-galactosidase 

25 (LacZ), alkaline phosphatase, thymidine kinase, green fluorescent protein (GFP), 
chloramphenicol acetyltransferase (CAT), luciferase, membrane bound proteins 
including, for example, CD2, CD4, CD8, the influenza hemagglutinin protein, and 
others well known in the art, to which high affinity antibodies directed thereto exist or 
can be produced by conventional means, and fusion proteins comprising a membrane 

30 bound protein appropriately fused to an antigen tag domain from, among others, 
hemagglutinin or Myc. These coding sequences, when associated with regulatory 
elements which drive their expression, provide signals detectable by conventional 
means, including enzymatic, radiographic, colorimetric, fluorescence or other 
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spectrographs assays, fluorescent activating cell sorting assays and immunological 
assays, including enzyme linked immunosorbent assay (ELISA), radioimmunoassay 
(RIA) and immunohistochemistry. For example, where the marker sequence is the 
LacZ gene, the presence of the vector carrying the signal is detected by assays for 
5 beta-galactosidase activity. Where the transgene is GFP or luciferase, the vector 
carrying the signal may be measured visually by color or light production in a 
luminometer. 

However, desirably, the transgene is a non-marker sequence 
encoding a product which is useful in biology and medicine, such as proteins, 
10 peptides, RNA, enzymes, or catalytic RNAs. Desirable RNA molecules include 

tRNA, dsRNA, ribosomal RNA, si RNAs, small hairpin RNAs, trans-splicing RNAs, 
catalytic RNAs, and antisense RNAs. One example of a useful RNA sequence is a 
sequence which extinguishes expression of a targeted nucleic acid sequence in the 
treated animal. 

1 5 The transgene may be used for treatment, e.g. , of genetic 

deficiencies, as a cancer therapeutic or vaccine, for induction of an immune response, 
and/or for prophylactic vaccine purposes. As used herein, induction of an immune 
response refers to the ability of a molecule {e.g. , a gene product) to induce a T cell 
and/or a humoral immune response to the molecule. The invention further includes 

20 using multiple transgenes, e.g. , to correct or ameliorate a condition caused by a multi- 
subunit protein. In certain situations, a different transgene may be used to encode 
each subunit of a protein, or to encode different peptides or proteins. This is desirable 
when the size of the DNA encoding the protein subunit is large, e.g., for an 
immunoglobulin, the platelet-derived growth factor, or a dystrophin protein. In order 

25 for the cell to produce the multi-subunit protein, a cell is infected with the 

recombinant virus containing each of the different subunits. Alternatively, different 
subunits of a protein may be encoded by the same transgene. In this case, a single 
transgene includes the DNA encoding each of the subunits, with the DNA for each 
subunit separated by an internal ribozyme entry site (IRES). This is desirable when 

30 the size of the DNA encoding each of the subunits is small, e.g., the total size of the 
DNA encoding the subunits and the IRES is less than five kilobases. As an alternative 
to an IRES, the DNA may be separated by sequences encoding a 2A peptide, which 
self-cleaves in a post-translational event. See, e.g., M.L. Donnelly, et al 9 J. Gen. 
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tt>o/.,78(Pt 1):13-21 (Jan 1997); Furler, S., etal>Gene Ther., 8(ll):864-873 (June 
2001); Klump H., et aL % Gene Ther.> 8(10):81 1-817 (May 2001). This 2A peptide is 
significantly smaller than an IRES, making it well suited for use when space is a 
limiting factor. However, the selected transgene may encode any biologically active 
5 product or other product, e.g., a product desirable for study. 

Suitable transgenes may be readily selected by one of skill in 
the art. The selection of the transgene is not considered to be a limitation of this 
invention. 

2. Vector and Transgene Regulatory Elements 

10 In addition to the major elements identified above for the 

minigene, the adenoviral vector also includes conventional control elements which 
are operably linked to the transgene in a manner that permits its transcription, 
translation and/or expression in a cell transfected with the plasmid vector or infected 
with the virus produced by the invention. As used herein, "operably linked" 

1 5 sequences include both expression control sequences that are contiguous with the 
gene of interest and expression control sequences that act in trans or at a distance to 
control the gene of interest. 

Expression control sequences include appropriate transcription 
initiation, termination, promoter and enhancer sequences; efficient RNA processing 

20 signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize 
cytoplasmic mRNA; sequences that enhance translation efficiency (i.e. 9 Kozak 
consensus sequence); sequences that enhance protein stability; and when desired, 
sequences that enhance secretion of the encoded product. A great number of 
expression control sequences, including promoters which are native, constitutive, 

25 inducible and/or tissue-specific, are known in the art and may be utilized. 

Examples of constitutive promoters include, without 
limitation, the retroviral Rous sarcoma virus (RS V) LTR promoter (optionally with 
the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV 
enhancer) [see, e.g., Boshart et al 9 Cell, 41:521-530 (1985)], the SV40 promoter, the 

30 dihydrofolate reductase promoter, the p-actin promoter, the phosphoglycerol kinase 
(PGK) promoter, and the EFla promoter [Invitrogen]. 

Inducible promoters allow regulation of gene expression and 
can be regulated by exogenously supplied compounds, environmental factors such as 
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temperature, or the presence of a specific physiological state, e.g., acute phase, a 
particular differentiation state of the cell, or in replicating cells only. Inducible 
promoters and inducible systems are available from a variety of commercial sources, 
including, without limitation, Invitrogen, Clontech and Ariad. Many other systems 
5 have been described and can be readily selected by one of skill in the art. For 

example, inducible promoters include the zinc-inducible sheep metallothionine (MT) 
promoter and the dexamethasone (Dex)-inducible mouse mammary tumor virus 
(MMTV) promoter. Other inducible systems include the T7 polymerase promoter 
system [WO 98/10088]; the ecdysone insect promoter [No et al, Proc. Natl Acad. 

10 Set USA, 93:3346-3351 (1996)], the tetracycline-repressible system [Gossene/ al, 
Proc, Natl Acad. Set USA, 89:5547-5551 (1992)], the tetracycline-inducible system 
[Gossen et al, Science, 268:1766-1769 (1995), see also Harvey et al, Curr. Opin. 
Chem. Biol, 2:512-518 (1998)]. Other systems include the FK506 dimer, VP16 or 
p65 using castradiol, diphenol murislerone, the RU486-inducible system [Wang et al, 

15 Nat Biotech, 15:239-243 (1997) and Wang et al, Gene Ther., 4:432-441 (1997)] and 
the rapamycin-inducible system [Magari etal, J. Clin. Invest, 100:2865-2872 
(1997)]. The effectiveness of some inducible promoters increases over time. In such 
cases one can enhance the effectiveness of such systems by inserting multiple 
repressors in tandem, e.gl, TetR linked to a TetR by an IRES. Alternatively, one can 

20 wait at least 3 days before screening for the desired function. One can enhance 

expression of desired proteins by known means to enhance the effectiveness of this 
system. For example, using the Woodchuck Hepatitis Virus Posttranscriptional 
Regulatory Element (WPRE). 

In another embodiment, the native promoter for the transgene 

25 will be used. The native promoter may be preferred when it is desired that expression 
of the transgene should mimic the native expression. The native promoter may be 
used when expression of the transgene must be regulated temporally or 
developmental^, or in a tissue-specific manner, or in response to specific 
transcriptional stimuli. In a further embodiment, other native expression control 

30 elements, such as enhancer elements, polyadenylation sites or Kozak consensus 
sequences may also be used to mimic the native expression. 

Another embodiment of the transgene includes a transgene 
operably linked to a tissue-specific promoter. For instance, if expression in skeletal 
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muscle is desired, a promoter active in muscle should be used. These include the 
promoters from genes encoding skeletal (3-actin, myosin light chain 2A, dystrophin, 
muscle creatine kinase, as well as synthetic muscle promoters with activities higher 
than naturally occurring promoters (see Li etal, Nat. Biotech., 17:241-245 (1999)). 
5 Examples of promoters that are tissue-specific are known for liver (albumin, 

Miyatake et al, J. Virol, 71:5124-32 (1997); hepatitis B virus core promoter, Sandig 
et al, Gene Titer., 3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnot et al, Hum. 
Gene Titer., 7:1503-14 (1996)), bone osteocalcin (Stein et al, Mol Biol. Rep., 
24:185-96 (1997)); bone sialoprotein (Chen et al, J. Bone Miner. Res., 11:654-64 

10 (1 996)), lymphocytes (CD2, Hansal et al, J. Immunol , 161 : 1 063-8 (1 998); 

immunoglobulin heavy chain; T cell receptor chain), neuronal such as neuron-specific 
enolase (NSE) promoter (Andersen et al, Cell Mol. Neurobiol, 13:503-15 (1993)), 
neurofilament light-chain gene (Piccioli et al, Proc. Natl Acad. Sci. USA, 88:561 1-5 
(1991)), and the neuron-specific vgf gene (Piccioli et al, Neuron, 15:373-84 (1995)), 

15 among others. 

Optionally, vectors carrying transgenes encoding 
therapeutically useful or immunogenic products may also include selectable markers 
or reporter genes may include sequences encoding geneticin, hygromicin or 
purimycin resistance, among others. Such selectable reporters or marker genes 

20 (preferably located outside the viral genome to be packaged into a viral particle) can 
be used to signal the presence of the plasmids in bacterial cells, such as ampicillin 
resistance. Other components of the vector may include an origin of replication. 
Selection of these and other promoters and vector elements are conventional and 
many such sequences are available [see, e.g., Sambrook et al, and references cited 

25 therein]. 

These vectors are generated using the techniques and 
sequences provided herein, in conjunction with techniques known to those of skill in 
the art. Such techniques include conventional cloning techniques of cDNA such as 
those described in texts [Sambrook et al, Molecular Cloning: A Laboratory Manual, 
30 Cold Spring Harbor Press, Cold Spring Harbor, NY], use of overlapping 

oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and 
any suitable method which provides the desired nucleotide sequence. 
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II. Production of the Recombinant Viral Particle 

In one embodiment, the invention provides a method of generating 
recombinant chimeric adenoviral particles in which the capsid of the chimeric 
adenovirus is of a serotype incapable of efficient growth in the selected host cell. A 
5 vector suitable for production of recombinant chimeric adenoviral particles can be 
generated by direct cloning. Alternatively, such particles can be generated by 
homologous recombination between a first vector containing the left end of the 
chimeric adenoviral genome and a second vector containing the right end of the 
chimeric adenoviral genome. However, any suitable methodology known to those of 

10 skill in the art can be readily utilized to generate a vector suitable to generate a 

production vector, preferably which contains the entire chimeric adenoviral genome, 
including a minigene. This production vector is then introduced into a host cell in 
which the adenoviral capsid protein is assembled and the chimeric adenoviral particle 
assembled as described. 

15 The chimeric adenoviruses of the invention include those in which one or 

more adenoviral genes are absent, or otherwise rendered non-functional. If any of the 
missing gene functions are essential to the replication and infectivity of the adenoviral 
particle, these functions are supplied by a complementation (or transcomplementing) 
cell line or a helper vector expressing these functions during production of the 

20 chimeric adenoviral particle. 

Examples of chimeric adenoviruses containing such missing adenoviral gene 
functions include those which are partially or completely deleted in the El a and/or 
Elb gene. In such a case, the El gene functions can be supplied by the packaging 
host cell, permitting the chimeric construct to be deleted of El gene functions and, if 

25 desired, for a transgene to be inserted in this region. Optionally, the El gene can be 
of a serotype which transcomplements the serotype providing the other adenovirus 
sequences in order to further reduce the possibility of recombination and improve 
safety. In other embodiments, it is desirable to retain an intact Ela and/or Elb region 
in the recombinant adenoviruses. Such an intact El region may be located in its 

30 native location in the adenoviral genome or placed in the site of a deletion in the 
native adenoviral genome (e.g., in the E3 region). 

In another example, all or a portion of the adenovirus delayed early gene E3 
may be eliminated from the chimeric adenovirus. The function of adenovirus E3 is 
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believed to be irrelevant to the function and production of the recombinant virus 
particle. Chimeric adenovirus vectors may also be constructed having a deletion of at 
least the ORF6 region of the E4 gene, and more desirably because of the redundancy 
in the function of this region, the entire E4 region. Still another vector of this 
5 invention contains a deletion in the delayed early gene E2a. Similarly, deletions in 
the intermediate genes DC and rVa 2 may be useful for some purposes. Optionally, 
deletions may also be made in selected portions of the late genes LI through L5, as 
described above. 

Other deletions may be made in the other structural or non-structural 

10 adenovirus genes. The above-discussed deletions may be used individually, i.e., an 
adenovirus sequence for use in the present invention may contain deletions in only a 
single region. Alternatively, deletions of entire genes or portions thereof effective to 
destroy their biological activity may be used in any combination. For example, in one 
exemplary vector, the adenovirus sequence may have deletions of the El genes and 

15 the E4 gene, or of the El, E2a and E3 genes, or of the El and E3 genes, or of El, E2a 
and E4 genes, with or without deletion of E3, and so on. As discussed above, such 
deletions may be used in combination with other mutations, such as temperature- 
sensitive mutations, to achieve a desired result. 

Examples of suitable transcomplementing serotypes are provided above. The 

20 use of transcomplementing serotypes can be particularly advantageous where there is 
diversity between the Ad sequences in the vector of the invention and the human 
AdEl sequences found in currently available packaging cells. In such cases, the use 
of the current human El-containing cells prevents the generation of replication- 
competent adenoviruses during the replication and production process. However, in 

25 certain circumstances, it will be desirable to utilize a cell line which expresses the El 
gene products can be utilized for production of an El -deleted simian adenovirus. 
Such cell lines have been described. See, e.g., US Patent 6,083,716. 
A. Packaging Host Cells 

Suitably, the packaging host cell is selected from among cells in 

30 which the adenovirus serotype donating the left and right terminal ends of the 

chimeric genome are capable of efficient growth. The host cells are preferably of 
mammalian origin, and most preferably are of non-human primate or human origin. 
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Particularly desirable host cells are selected from among any 
mammalian species, including, without limitation, cells such as A549 [ATCC 
Accession No. CCL 185], 911 cells, WEHI, 3T3, 10T1/2, HEK 293 cells or PERC6 
(both of which express functional adenoviral El) [Fallaux, FJ et al, (1998), Hum 
5 Gene Ther, 9:19094917], Saos, C2C12, L cells, HT1080, HepG2, HeLa [ATCC 
Accession No. CCL 2], KB [CCL 1 7], Detroit [e.g. , Detroit 5 1 0, CCL 72] and WI-3 8 
[CCL 75] cells, and primary fibroblast, hepatocyte and myoblast cells derived from 
mammals including human, monkey, mouse, rat, rabbit, and hamster. These cell lines 
are all available from the American Type Culture Collection, 10801 University 

10 Boulevard, Manassas, Virginia 20110-2209. Other suitable cell lines may be 

obtained from other sources. The selection of the mammalian species providing the 
cells is not a limitation of this invention; nor is the type of mammalian cell, i.e., 
fibroblast, hepatocyte, tumor cell, etc. 

As described above, a chimeric adenovirus of the invention can lack 

1 5 one or more functional adenoviral regulatory and/or structural genes which are 
supplied either by the host cell or in trans to effect packaging of the chimeric 
adenovirus into the viral capsid to generate the viral particle. Thus, the ability of a 
selected host cell to supply transcomplementing adenoviral sequences may be taken 
into consideration in selecting a desired host cell. 

20 In one example, the cells are from a stable cell line which expresses 

adenovirus Ela and Elb functions from a cell line which transcomplements the 
adenovirus serotype which donates the left and ri^ht termini to the chimera of the 
invention, permitting the chimera to be El -deleted. Alternatively, where the cell line 
does not transcomplementthe adenovirus donating the termini, El functions may be 

25 provided by the chimera, or in trans. 

If desired, one may utilize the sequences provided herein to generate a 
packaging cell or cell line that expresses, at a minimum, the adenovirus El gene from 
the adenovirus serotype donating the 5' ITR under the transcriptional control of a 
promoter for expression, or a transcomplementing serotype, in a selected parent cell 

30 line. Inducible or constitutive promoters may be employed for this purpose. 

Examples of such promoters are described in detail elsewhere in this specification. A 
parent cell is selected for the generation of a novel cell line expressing any desired 
adenovirus or adenovirus gene, including, e.g., a human Ad5, AdPan5, Pan6, Pan7, 
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SV1, SV25 or SV39 gene. Without limitation, such a parent cell line may be HeLa 
[ATCC Accession No. CCL 2], A549 [ATCC Accession No. CCL 185], HEK 293, 
KB [CCL 17], Detroit [e.g., Detroit 510, CCL 72] and WI-38 [CCL 75] cells, among 
others. Many of these cell lines are all available from the ATCC. Other suitable 
5 parent cell lines may be obtained from other sources. 

Such El -expressing cell lines are useful in the generation of chimeric 
adenovirus El deleted vectors. Additionally, or alternatively, the invention provides 
cell lines that express one or more simian adenoviral gene products, e.g., Ela, Elb, 
E2a, and/or E4 ORF6, can be constructed using essentially the same procedures for 

10 use in the generation of chimeric viral vectors. Such cell lines can be utilized to 

transcomplement adenovirus vectors deleted in the essential genes that encode those 
products, or to provide helper functions necessary for packaging of a helper- 
dependent virus (e.g., adeno-associated virus). The preparation of a host cell 
according to this invention involves techniques such as assembly of selected DNA 

1 5 sequences. This assembly may be accomplished utilizing conventional techniques. 
Such techniques include cDNA and genomic cloning, which are well known and are 
described in Sambrook et al., cited above, use of overlapping oligonucleotide 
sequences of the adenovirus genomes, combined with polymerase chain reaction, 
synthetic methods, and any other suitable methods which provide the desired 

20 nucleotide sequence. 

In still another alternative, the essential adenoviral gene products are 
provided in trans by the adenoviral vector and/or helper virus. In such an instance, a 
suitable host cell can be selected from any biological organism, including prokaryotic 
(e.g., bacterial) cells, and eukaryotic cells, including, insect cells, yeast cells and 

25 mammalian cells. Particularly desirable host cells are selected from among any 
mammalian species, including, without limitation, cells such as A549, WEHI, 3T3, 
10T1/2, HEK 293 cells or PERC6 (both of which express functional adenoviral El) 
[Fallaux, FJ et al, (1998), Hum Gene Ther, 9:1909-1917], Saos, C2C12, L cells, 
HT1080, HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from 

30 mammals including human, monkey, mouse, rat, rabbit, and hamster. The selection 
of the mammalian species providing the cells is not a limitation of this invention; nor 
is the type of mammalian cell, i.e., fibroblast, hepatocyte, tumor cell, etc. 
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B. Helper Vectors 

Thus, depending upon the adenovirus gene content of the adenoviral 
vectors and any adenoviral gene functions expressed from the host cell, a helper 
vector may be necessary to provide sufficient adenovirus gene sequences necessary to 
5 produce an infective recombinant viral particle containing the minigene. See, for 
example, the techniques described for preparation of a "minimal" human Ad vector in 
International Patent Application W096/13597, published May 9, 1996, and 
incorporated herein by reference. Suitably, these helper vectors may be non- 
replicating genetic elements, a plasmid, or a virus. 

10 Useful helper vectors contain selected adenovirus gene sequences not 

present in the adenovirus vector construct and/or not expressed by the packaging cell 
line in which the vector is transfected. In one embodiment, the helper virus is 
replication-defective and contains a variety of adenovirus genes in addition to the 
sequences described above. Such a helper vector is desirably used in combination 

1 5 with an E 1 -expressing cell line. 

Helper vectors may be formed into poly-cation conjugates as 
described in Wu et al, J. Biol Chem., 264:16985-16987 (1989); K. J. Fisher and J. M. 
Wilson, Biochem. J., 299:49 (April 1, 1994). A helper vector may optionally contain 
a second reporter minigene. A number of such reporter genes are known to the art. 

20 The presence of a reporter gene on the helper virus which is different from the 

transgene on the adenovirus vector allows both the Ad vector and the helper vector to 
be independently monitored. This second reporter is used to enable separation 
between the resulting recombinant virus and the helper virus upon purification. 
C. Assembly of Viral Particle and Transfection of a Cell Line 

25 Generally, when delivering the vector comprising the minigene by 

transfection, the vector is delivered in an amount from about 5 jxg to about 100 jxg 
DNA, and preferably about 10 to about 50 \xg DNA to about 1 x 10 4 cells to about 1 x 
10 13 cells, and preferably about 10 5 cells. However, the relative amounts of vector 
DNA to host cells may be adjusted, taking into consideration such factors as the 

30 selected vector, the delivery method and the host cells selected. 

Introduction into the host cell of the vector may be achieved by any 
means known in the art or as disclosed above, including transfection, and infection. 
One or more of the adenoviral genes may be stably integrated into the genome of the 
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host cell, stably expressed as episomes, or expressed transiently. The gene products 
may all be expressed transiently, on an episome or stably integrated, or some of the 
gene products may be expressed stably while others are expressed transiently. 

Furthermore, the promoters for each of the adenoviral genes may be 
5 selected independently from a constitutive promoter, an inducible promoter or a 
native adenoviral promoter. The promoters may be regulated by a specific 
physiological state of the organism or cell (i.e., by the differentiation state or in 
replicating or quiescent cells) or by exogenously added factors, for example. 

Introduction of the molecules (as plasmids or viruses) into the host 

1 0 cell may also be accomplished using techniques known to the skilled artisan and as 
discussed throughout the specification. In preferred embodiment, standard 
transfection techniques are used, e.g., CaP0 4 transfection or electroporation. 

Assembly of the selected DNA sequences of the adenovirus (as well 
as the transgene and other vector elements) into various intermediate plasmids, and 

15 the use of the plasmids and vectors to produce a recombinant viral particle are all 
achieved using conventional techniques. Such techniques include direct cloning as 
described [G. Gao et al, Gene Ther. 2003 Oct; 10(22): 1926-1930; US Patent 
Publication No. 2003-00921 61 -A, published May 15, 2003; International Patent 
Application No. PCT/US03/12405]. Other cloning techniques of cDNA such as 

20 those described in texts [Sambrook et al, cited above], use of overlapping 

oligonucleotide sequences of the adenovirus genomes, polymerase chain reaction, and 
any suitable method which provides the desired nucleotide sequence can be utilized. 
Standard transfection and co-transfection techniques are employed, e.g., CaP04 
precipitation techniques. Other conventional methods employed include homologous 

25 recombination of the viral genomes, plaquing of viruses in agar overlay, methods of 
measuring signal generation, and the like. 

For example, following the construction and assembly of the desired 
minigene-containing viral vector, the vector is transfected in vitro in the presence of 
an optional helper vector into the packaging cell line. The functions expressed from 

30 the plasmid, packaging cell line and helper virus, if any, permits the adenovirus- 
transgene sequences in the vector to be replicated and packaged into virion capsids, 
resulting in the chimeric viral particles. The current method for producing such virus 
particles is transfection-based. However, the invention is not limited to such 
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10 



methods. The resulting chimeric adenoviruses are useful in transferring a selected 
transgene to a selected cell. 

III. Use of the Chimeric Adenovirus Vectors 

The chimeric adenovirus vectors of the invention are useful for gene transfer 
to a human or veterinary subject (including, non-human primates, non-simian 
primates, and other mammals) in vitro, ex vivo, and in vivo. 

The recombinant adenovirus vectors described herein can be used as 
expression vectors for the production of the products encoded by the heterologous 
genes in vitro. For example, the recombinant adenoviruses containing a gene inserted 
into the location of an El deletion may be transfected into an El-expressing cell line 
as described above. Alternatively, replication-competent adenoviruses may be used 
in another selected cell line. The transfected cells are then cultured in the 
conventional manner, allowing the recombinant adenovirus to express the gene 
1 5 product from the promoter. The gene product may then be recovered from the culture 
medium by known conventional methods of protein isolation and recovery from 
culture. 

A chimeric adenoviral vector of the invention provides an efficient gene 
transfer vehicle that can deliver a selected transgene to a selected host cell in vivo or 

20 ex vivo even where the organism has neutralizing antibodies to one or more AAV 
serotypes. In one embodiment, the rAd and the cells are mixed ex vivo; the infected 
cells are cultured using conventional methodologies; and the transduced cells are re- 
infused into the patient. These compositions are particularly well suited to gene 
delivery for therapeutic purposes and for immunization, including inducing protective 

25 immunity. 

More commonly, the chimeric adenoviral vectors of the invention will be 
utilized for delivery of therapeutic or immunogenic molecules, as described below. It 
will be readily understood for both applications that the recombinant adenoviral 
vectors of the invention are particularly well suited for use in regimens involving 
30 repeat delivery of recombinant adenoviral vectors. Such regimens typically involve 
delivery of a series of viral vectors in which the viral capsids are alternated. The viral 
capsids may be changed for each subsequent administration, or after a pre-selected 
number of administrations of a particular serotype capsid (e.g., one, two, three, four 
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or more). Thus, a regimen may involve delivery of a rAd with a first capsid, delivery 
with a rAd with a second capsid, and delivery with a third capsid. A variety of other 
regimens which use the Ad capsids of the invention alone, in combination with one 
another, or in combination with other Ad serotypes will be apparent to those of skill 
5 in the art. Optionally, such a regimen may involve administration of rAd with capsids 
of non-human primate adenoviruses, human adenoviruses, or artificial {e.g., chimeric) 
serotypes such as are described herein. Each phase of the regimen may involve 
administration of a series of injections (or other delivery routes) with a single Ad 
serotype capsid followed by a series with another Ad serotype capsid. Alternatively, 

1 0 the recombinant Ad vectors of the invention may be utilized in regimens involving 
other non-adenoviral-mediated delivery systems, including other viral systems, non- 
viral delivery systems, protein, peptides, and other biologically active molecules. 

The following sections will focus on exemplary molecules which may be 
delivered via the adenoviral vectors of the invention. 

15 A. Ad-Mediated Delivery of Therapeutic Molecules 

In one embodiment, the Ad vectors described herein are administered 
to humans according to published methods for gene therapy. A viral vector of the 
invention bearing the selected transgene may be administered to a patient, preferably 
suspended in a biologically compatible solution or pharmaceutically acceptable 

20 delivery vehicle. A suitable vehicle includes sterile saline. Other aqueous and non- 
aqueous isotonic sterile injection solutions and aqueous and non-aqueous sterile 
suspensions known to be pharmaceutically acceptable carriers and well known to 
those of skill in the art may be employed for this purpose. 

The adenoviral vectors are administered in sufficient amounts to 

25 transduce the target cells and to provide sufficient levels of gene transfer and 

expression to provide a therapeutic benefit without undue adverse or with medically 
acceptable physiological effects, which can be determined by those skilled in the 
medical arts. Conventional and pharmaceutically acceptable routes of administration 
include, but are not limited to, direct delivery to the retina and other intraocular 

30 delivery methods, direct delivery to the liver, inhalation, intranasal, intravenous, 
intramuscular, intratracheal, subcutaneous, intradermal, rectal, oral and other 
parenteral routes of administration. Routes of administration may be combined, if 
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desired, or adjusted depending upon the transgene or the condition. The route of 
administration primarily will depend on the nature of the condition being treated. 

Dosages of the viral vector will depend primarily on factors such as 
the condition being treated, the age, weight and health of the patient, and may thus 
5 vary among patients. For example, a therapeutically effective adult human or 

veterinary dosage of the viral vector is generally in the range of from about 100 to 
about 100 mL of a carrier containing concentrations of from about 1 x 10 6 to about 1 
x 10 15 particles, about 1 x 10 11 to 1 x 10 13 particles, or about 1 x 10 9 to lx 10 12 
particles. Dosages will range depending upon the size of the animal and the route of 

10 administration. For example, a suitable human or veterinary dosage (for about an 80 
kg animal) for intramuscular injection is in the range of about 1 x 10 9 to about 5 x 
10 12 particles per mL, for a single site. Optionally, multiple sites of administration 
may be delivered. In another example, a suitable human or veterinary dosage may be 
in the range of about 1 x 10 u to about 1 x 10 15 particles for an oral formulation. One 

1 5 of skill in the art may adjust these doses, depending the route of administration, and 
the therapeutic or vaccinal application for which the recombinant vector is employed. 
The levels of expression of the transgene, or for an immunogen, the level of 
circulating antibody, can be monitored to determine the frequency of dosage 
administration. Yet other methods for determining the timing of frequency of 

20 administration will be readily apparent to one of skill in the art. 

An optional method step involves the co-administration to the 
patient, either concurrently with, or before or after administration of the viral vector, 
of a suitable amount of a short acting immune modulator. The selected immune 
modulator is defined herein as an agent capable of inhibiting the formation of 

25 neutralizing antibodies directed against the recombinant vector of this invention or 
capable of inhibiting cytolytic T lymphocyte (CTL) elimination of the vector. The 
immune modulator may interfere with the interactions between the T helper subsets 
(T H i or Th2) and B cells to inhibit neutralizing antibody formation. Alternatively, the 
immune modulator may inhibit the interaction between T H i cells and CTLs to reduce 

30 the occurrence of CTL elimination of the vector. A variety of useful immune 

modulators and dosages for use of same are disclosed, for example, in Yang et al, J, 
Virol, 70(9) (Sept 1996); International Patent Application No. WO96/12406, 
published May 2, 1996; and International Patent Application No.PCT/US96/03035, 
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all incorporated herein by reference. Typically, such immune modulators would be 
selected when the transgene is a therapeutic which requires repeat delivery. 
1 . Therapeutic Transgenes 

Useful therapeutic products encoded by the transgene include 
5 hormones and growth and differentiation factors including, without limitation, insulin, 
glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone 
releasing factor (GRF), follicle stimulating hormone (FSH), luteinizing hormone 
(LH), human chorionic gonadotropin (hCG), vascular endothelial growth factor 
(VEGF), angiopoietins, angiostatin, granulocyte colony stimulating factor (GCSF), 

1 0 erythropoietin (EPO), connective tissue growth factor (CTGF), basic fibroblast 
growth factor (bFGF), acidic fibroblast growth factor (aFGF), epidermal growth 
factor (EGF), transforming growth factor a (TGF a), platelet-derived growth factor 
(PDGF), insulin growth factors I and II (IGF-I and IGF-II), any one of the 
transforming growth factor superfamily, including TGF, activins, inhibins, or any of 

1 5 the bone morphogenic proteins (BMP) BMPs 1 - 1 5, any one of the 

heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth factors, 
nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins 
NT-3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line derived 
neurotrophic factor (GDNF), neurturin, agrin, any one of the family of 

20 semaphorins/collapsins, netrin-1 and netrin-2, hepatocyte growth factor (HGF), 
ephrins, noggin, sonic hedgehog and tyrosine hydroxylase. 

Other useful transgene products include proteins that regulate the 
immune system including, without limitation, cytokines and lymphokines such as 
thrombopoietin (TPO), interleukins (DL) DL-1 through DL-25 (including, e.g., IL-2, IL- 

25 4, IL-12 and IL-18), monocyte chemoattractant protein, leukemia inhibitory factor, 
granulocyte-macrophage colony stimulating factor, Fas ligand, tumor necrosis factors 
and, interferons, and, stem cell factor, flk-2/flt3 ligand. Gene products produced by 
the immune system are also useful in the invention. These include, without 
limitation, immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, 

30 humanized antibodies, single chain antibodies, T cell receptors, chimeric T cell 

receptors, single chain T cell receptors, class I and class II MHC molecules, as well as 
engineered immunoglobulins and MHC molecules. Useful gene products also include 
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complement regulatory proteins such as complement regulatory proteins, membrane 
cofactor protein (MCP), decay accelerating factor (DAF), CR1, CF2 and CD59. 

Still other useful gene products include any one of the receptors for 
the hormones, growth factors, cytokines, lymphokines, regulatory proteins and 
5 immune system proteins. The invention encompasses receptors for cholesterol 
regulation, including the low density lipoprotein (LDL) receptor, high density 
lipoprotein (HDL) receptor, the very low density lipoprotein (VLDL) receptor, 
proteins useful in the regulation of lipids, including, e.g., apolipoprotein (apo) A and 
its isoforms (e.g., ApoAI), apoE and its isoforms including E2, E3 and E4), SRB1, 

10 ABC1, and the scavenger receptor. The invention also encompasses gene products 
such as members of the steroid hormone receptor superfamily including 
glucocorticoid receptors and estrogen receptors, Vitamin D receptors and other 
nuclear receptors. In addition, useful gene products include transcription factors such 
as jun, fos, max, mad, serum response factor (SRF), AP-1, AP2, myb, MyoD and 

15 myogenin, ETS-box containing proteins, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, 
ZF5, NFAT, CREB, HNF-4, C/EBP, SP1, CCAAT-box binding proteins, interferon 
regulation factor (IRF-1), Wilms tumor protein, ETS-binding protein, STAT, GATA- 
box binding proteins, e.g., GATA-3, and the forkhead family of winged helix 
proteins. 

20 Other useful gene products include, carbamoyl synthetase I, ornithine 

transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase, 
fiimarylacetacetate hydrolase, phenylalanine hydroxylase, alpha- 1 antitrypsin, 
glucose-6-phosphatase, porphobilinogen deaminase, cystathione beta-synthase, 
branched chain ketoacid decarboxylase, albumin, isovaleryl-coA dehydrogenase, 

25 propionyl CoA carboxylase, methyl malonyl CoA mutase, glutaryl CoA 
dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic 
phosphorylase, phosphorylase kinase, glycine decarboxylase, H-protein, T-protein, a 
cystic fibrosis transmembrane regulator (CFTR) sequence, and a dystrophin cDNA 
sequence. Other useful gene products include those useful for treatment of 

30 hemophilia A (e.g. , Factor VIII and its variants, including the light chain and heavy 
chain of the heterodimer, optionally operably linked by a junction), and the B-domain 
deleted Factor Vm, see US 6,200,560 and 6,221,349], and useful for treatment of 
hemophilia B (e.g, Factor IX). 
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Still other useful gene products include non-naturally occurring 
polypeptides, such as chimeric or hybrid polypeptides having a non-naturally 
occurring amino acid sequence containing insertions, deletions or amino acid 
substitutions. For example, single-chain engineered immunoglobulins could be useful 
5 in certain immunocompromised patients. Other types of non-naturally occurring gene 
sequences include antisense molecules and catalytic nucleic acids, such as ribozymes, 
which could be used to reduce overexpression of a target 

Reduction and/or modulation of expression of a gene are particularly 
desirable for treatment of hyperproliferative conditions characterized by 

10 hyperproliferating cells, as are cancers and psoriasis. Target polypeptides include 
those polypeptides which are produced exclusively or at higher levels in 
hyperproliferative cells as compared to normal cells. Target antigens include 
polypeptides encoded by oncogenes such as myb, myc, fyn, and the translocation 
gene bcr/abl, ras, src, P53, neu, trk and EGRF. In addition to oncogene products as 

1 5 target antigens, target polypeptides for anti-cancer treatments and protective regimens 
include variable regions of antibodies made by B cell lymphomas and variable 
regions of T cell receptors of T cell lymphomas which, in some embodiments, are 
also used as target antigens for autoimmune disease. Other tumor-associated 
polypeptides can be used as target polypeptides such as polypeptides which are found 

20 at higher levels in tumor cells including the polypeptide recognized by monoclonal 
antibody 17-1 A and folate binding polypeptides. 

Other suitable therapeutic polypeptides and proteins include those 
which may be useful for treating individuals suffering from autoimmune diseases and 
disorders by conferring a broad based protective immune response against targets that 

25 are associated with autoimmunity including cell receptors and cells which produce 
self-directed antibodies. T-cell mediated autoimmune diseases include rheumatoid 
arthritis (RA), multiple sclerosis (MS), Sjogren's syndrome, sarcoidosis, insulin 
dependent diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis, 
ankylosing spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis, 

30 vasculitis, Wegener's granulomatosis, Crohn's disease and ulcerative colitis. Each of 
these diseases is characterized by T cell receptors (TCRs) that bind to endogenous 
antigens and initiate the inflammatory cascade associated with autoimmune diseases. 
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The chimeric adenoviral vectors of the invention are particularly well 
suited for therapeutic regimens in which multiple adenoviral-mediated deliveries of 
transgenes is desired, e.g., in regimens involving redelivery of the same transgene or 
in combination regimens involving delivery of other transgenes. Such regimens may 
5 involve administration of a chimeric adenoviral vector, followed by re-administration 
with a vector from the same serotype adenovirus. Particularly desirable regimens 
involve administration of a chimeric adenoviral vector of the invention, in which the 
serotype of the viral vector delivered in the first administration differs from the 
serotype of the viral vector utilized in one or more of the subsequent administrations. 
10 For example, a therapeutic regimen involves administration of a chimeric vector and 
repeat administration with one or more adenoviral vectors of the same or different 
serotypes. In another example, a therapeutic regimen involves administration of an 
adenoviral vector followed by repeat administration with a chimeric vector of the 
invention which differs from the serotype of the first delivered adenoviral vector, and 
15 optionally further administration with another vector which is the same or, preferably, 
differs from the serotype of the vector in the prior administration steps. These 
regimens are not limited to delivery of adenoviral vectors constructed using the 
chimeric serotypes of the invention. Rather, these regimens can readily utilize 
chimeric or non-chimeric vectors of other adenoviral serotypes, which may be of 
20 artificial, human or non-human primate, or other mammalian sources, in combination 
with one or more of the chimeric vectors of the invention. Examples of such 
serotypes are discussed elsewhere in this document. Further, these therapeutic 
regimens may involve either simultaneous or sequential delivery of chimeric 
adenoviral vectors of the invention in combination with non-adenoviral vectors, non- 
25 viral vectors, and/or a variety of other therapeutically useful compounds or molecules. 
The present invention is not limited to these therapeutic regimens, a variety of which 
will be readily apparent to one of skill in the art. 

B. Ad-Mediated Delivery of Immunogenic Transgenes 
The adenoviruses of the invention may also be employed as 
30 immunogenic compositions. As used herein, an immunogenic composition is a 

composition to which a humoral (e.g., antibody) or cellular (e.g., a cytotoxic T cell) 
response is mounted to a transgene product delivered by the immunogenic 
composition following delivery to a mammal, and preferably a primate. The present 
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10 



20 



25 



invention provides an Ad that can contain in any of its adenovirus sequence deletions 
a gene encoding a desired immunogen. Chimeric adenoviruses based on simian or 
other non-human mammalian primate serotypes are likely to be better suited for use 
as a live recombinant virus vaccine in different animal species compared to an 
adenovirus of human origin, but is not limited to such a use. The recombinant 
adenoviruses can be used as prophylactic or therapeutic vaccines against any 
pathogen for which the antigen(s) crucial for induction of an immune response and 
able to limit the spread of the pathogen has been identified and for which the cDNA is 
available. 



Such vaccinal (or other immunogenic) compositions are formulated in 
a suitable delivery vehicle, as described above. Generally, doses for the 
immunogenic compositions are in the range defined above for therapeutic 
compositions. The levels of immunity of the selected gene can be monitored to 
determine the need, if any, for boosters. Following an assessment of antibody titers in 
1 5 the serum, optional booster immunizations may be desired. 

Optionally, a vaccinal composition of the invention may be formulated 
to contain other components, including, e.g. adjuvants, stabilizers, pH adjusters, 
preservatives and the like. Such components are well known to those of skill in the 
vaccine art. Examples of suitable adjuvants include, without limitation, liposomes, 
alum, monophosphoryl lipid A, and any biologically active factor, such as cytokine, 
an interleukin, a chemokine, a ligands, and optimally combinations thereof. Certain of 
these biologically active factors can be expressed in vrvo, e.g., via a plasmid or viral 
vector. For example, such an adjuvant can be administered with a priming DNA 
vaccine encoding an antigen to enhance the antigen-specific immune response 
compared with the immune response generated upon priming with a DNA vaccine 
encoding the antigen only. 

The adenoviruses are administered in "an immunogenic amount", that 
is, an amount of adenovirus that is effective in a route of administration to transfect 
the desired cells and provide sufficient levels of expression of the selected gene to 
30 induce an immune response. Where protective immunity is provided, the 

recombinant adenoviruses are considered to be vaccine compositions useful in 
preventing infection and/or recurrent disease. 
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Alternatively, or in addition, the vectors of the invention may contain 
a transgene encoding a peptide, polypeptide or protein which induces an immune 
response to a selected immunogen. The recombinant adenoviruses of this invention 
are expected to be highly efficacious at inducing cytolytic T cells and antibodies to 
5 the inserted heterologous antigenic protein expressed by the vector. 

For example, immunogens may be selected from a variety of viral 
families. Example of desirable viral families against which an immune response 
would be desirable include, the picornavirus family, which includes the genera 
rhinoviruses, which are responsible for about 50% of cases of the common cold; the 

10 genera enteroviruses, which include polioviruses, coxsackieviruses, echoviruses, and 
human enteroviruses such as hepatitis A virus; and the genera apthoviruses, which are 
responsible for foot and mouth diseases, primarily in non-human animals. Within the 
picornavirus family of viruses, target antigens include the VP 1, VP2, VP3, VP4, and 
VPG. Another viral family includes the calcivirus family, which encompasses the 

1 5 Norwalk group of viruses, which are an important causative agent of epidemic 

gastroenteritis. Still another viral family desirable for use in targeting antigens for 
inducing immune responses in humans and non-human animals is the togavirus 
family, which includes the genera alphavirus, which include Sindbis viruses, 
RossRiver virus, and Venezuelan, Eastern & Western Equine encephalitis, and 

20 rubivirus, including Rubella virus. The flaviviridae family includes dengue, yellow 
fever, Japanese encephalitis, St. Louis encephalitis and tick borne encephalitis 
viruses. Other target antigens may be generated from the Hepatitis C or the 
coronavirus family, which includes a number of non-human viruses such as infectious 
bronchitis virus (poultry), porcine transmissible gastroenteric virus (pig), porcine 

25 hemagglutinin encephalomyelitis virus (pig), feline infectious peritonitis virus 
(cats), feline enteric coronavirus (cat), canine coronavirus (dog), and human 
respiratory coronaviruses, which may cause the common cold and/or non-A, B or C 
hepatitis. In addition, the human coronaviruses include the putative causative agent 
of sudden acute respiratory syndrome (SARS). Within the coronavirus family, target 

30 antigens include the El (also called M or matrix protein), E2 (also called S or Spike 
protein), E3 (also called HE or hemagglutin-elterose) glycoprotein (not present in all 
coronaviruses), or N (nucleocapsid). Still other antigens may be targeted against the 
rhabdovirus family, which includes the genera vesiculovirus (e.g., Vesicular 
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Stomatitis Virus), and the general lyssavirus (e.g., rabies). Within the rhabdo virus 
family, suitable antigens may be derived from the G protein or the N protein. The 
family filoviridae, which includes hemorrhagic fever viruses such as Marburg and 
Ebola virus, may be a suitable source of antigens. The paramyxovirus family 
5 includes parainfluenza Virus Type 1, parainfluenza Virus Type 3, bovine 

parainfluenza Virus Type 3, rubulavirus (mumps virus), parainfluenza Virus Type 2, 
parainfluenza virus Type 4, Newcastle disease virus (chickens), rinderpest, 
morbillivirus, which includes measles and canine distemper, and pneumovirus, which 
includes respiratory syncytial virus. The influenza virus is classified within the 

10 family orthomyxovirus and is a suitable source of antigen (e.g., the HA protein, the 
Nl protein). The bunyavirus family includes the genera bunyavirus (California 
encephalitis, La Crosse), phlebovirus (Rift Valley Fever), hantavirus (puremala is a 
hemahagin fever virus), nairovirus (Nairobi sheep disease) and various unassigned 
bungaviruses. The arenavirus family provides a source of antigens against LCM and 

15 Lassa fever virus. The reovirus family includes the genera reovirus, rotavirus (which 
causes acute gastroenteritis in children), orbiviruses, and cultivirus (Colorado Tick 
fever), Lebombo (humans), equine encephalosis, blue tongue. 

The retrovirus family includes the sub-family oncorivirinal which 
encompasses such human and veterinary diseases as feline leukemia virus, HTLVI 

20 and HTLVII, lentivirinal (which includes human immunodeficiency virus (HTV), 
simian immunodeficiency virus (SIV), feline immunodeficiency virus (FIV), equine 
infectious anemia virus, and spumavirinal). Among the lentiviruses, many suitable 
antigens have been described and can readily be selected. Examples of suitable HTV 
and SIV antigens include, without limitation the gag, pol, Vif, Vpx, VPR, Env, Tat, 

25 Nef, and Rev proteins, as well as various fragments thereof. For example, suitable 
fragments of the Env protein may include any of its subunits such as the gpl20, 
gpl60, gp41, or smaller fragments thereof, e.g., of at least about 8 amino acids in 
length. Similarly, fragments of the tat protein may be selected. [See, US Patent 
5,891,994 and US Patent 6,193,981.] See, also, the HIV and SIV proteins described 

30 in D.H. Barouch et al, J. Virol, 75(5):2462-2467 (March 2001), and R.R. Amara, et 
al, Science, 292:69-7 r 4 (6 April 2001). In another example, the HTV and/or SIV 
immunogenic proteins or peptides may be used to form fusion proteins or other 
immunogenic molecules. See, e.g., the HIV-1 Tat and/or Nef fusion proteins and 
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immunization regimens described in WO 01/54719, published August 2, 2001, and 
WO 99/16884, published April 8, 1999. The invention is not limited to the HIV 
and/or SIV immunogenic proteins or peptides described herein. In addition, a variety 
of modifications to these proteins have been described or could readily be made by 
5 one of skill in the art. See, e.g., the modified gag protein that is described in US 
Patent 5,972,596. Further, any desired HTV and/or SIV immunogens may be 
delivered alone or in combination. Such combinations may include expression from a 
single vector or from multiple vectors. Optionally, another combination may involve 
delivery of one or more expressed immunogens with delivery of one or more of the 

10 immunogens in protein form. Such combinations are discussed in more detail below. 

The papovavirus family includes the sub-family polyomaviruses 
(BKU and JCU viruses) and the sub-family papillomavirus (associated with cancers 
or malignant progression of papilloma). The adenovirus family includes viruses (EX, 
AD7, ARD, O.B.) which cause respiratory disease and/or enteritis. The parvovirus 

15 includes family feline parvovirus (feline enteritis), feline panleucopeniavirus, canine 
parvovirus, and porcine parvovirus. The herpesvirus family includes the sub-family 
alphaherpesviiinae, which encompasses the genera simplexvirus (HSVI, HSVII), 
varicellovirus (pseudorabies, varicella zoster) and the sub-family betaherpesvirinae, 
which includes the genera cytomegalovirus (HCMV, muromegalovirus) and the 

20 sub-family gammaherpesvirinae, which includes the genera lymphocryptovirus, EB V 
(Burkitts lymphoma), infectious rhinotracheitis, Marek's disease virus, and 
rhadinovirus. The poxvirus family includes the sub-family chordopoxvirinae, which 
encompasses the genera orthopoxvirus (Variola (Smallpox) and Vaccinia (Cowpox)), 
parapoxvirus, avipoxvirus, capripoxvirus, leporipoxvirus, suipoxvirus, and the 

25 sub-family entomopoxvirinae. The hepadnavirus family includes the Hepatitis B 
virus. One unclassified virus which may be suitable source of antigens is the 
Hepatitis delta virus. Still other viral sources may include avian infectious bursal 
disease virus and porcine respiratory and reproductive syndrome virus. The 
alphavirus family includes equine arteritis virus and various Encephalitis viruses. 

30 The viruses of the present invention may also carry immunogens 

which are useful to immunize a human or non-human animal against other pathogens 
including bacteria, fungi, parasitic microorganisms or multicellular parasites which 
infect human and non-human vertebrates, or from a cancer cell or tumor cell. 
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Examples of bacterial pathogens include pathogenic gram-positive cocci include 
pneumococci; staphylococci; and streptococci. Pathogenic gram-negative cocci 
include meningococcus; gonococcus. Pathogenic enteric gram-negative bacilli 
include enterobacteriaceae; pseudomonas, acinetobacteria and eikenella; melioidosis; 
5 salmonella; shigella; haemophilus; moraxella; H. ducreyi (which causes chancroid); 
brucella; Franisella tularensis (which causes tularemia); yersinia (pasteurella); 
streptobacillus moniliformis and spirillum; Gram-positive bacilli include listeria 
monocytogenes; erysipelothrix rhusiopathiae; Corynebacterium diphtheria 
(diphtheria); cholera; B. anthracis (anthrax); donovanosis (granuloma inguinale); and 

10 bartonellosis. Diseases caused by pathogenic anaerobic bacteria include tetanus; 

botulism; other Clostridia; tuberculosis; leprosy; and other mycobacteria. Pathogenic 
spirochetal diseases include syphilis; treponematoses: yaws, pinta and endemic 
syphilis; and leptospirosis. Other infections caused by higher pathogen bacteria and 
pathogenic fungi include actinomycosis; nocardiosis; cryptococcosis, blastomycosis, 

15 histoplasmosis and coccidioidomycosis; candidiasis, aspergillosis, and mucormycosis; 
sporotrichosis; paracoccidiodomycosis, petriellidiosis, torulopsosis, mycetoma and 
chromomycosis; and dermatophytosis. Rickettsial infections include Typhus fever, 
Rocky Mountain spotted fever, Q fever, and Rickettsialpox. Examples of 
mycoplasma and chlamydial infections include: mycoplasma pneumoniae; 

20 lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections. 
Pathogenic eukaryotes encompass pathogenic protozoans and helminths and 
infections produced thereby include: amebiasis; malaria; leishmaniasis; 
trypanosomiasis; toxoplasmosis; Pneumocystis carinii; Trichans; Toxoplasma gondii; 
babesiosis; giardiasis; trichinosis; filariasis; schistosomiasis; nematodes; trematodes 

25 or flukes; and cestode (tapeworm) infections. 

Many of these organisms and/or toxins produced thereby have been 
identified by the Centers for Disease Control [(CDC), Department of Heath and 
Human Services, USA], as agents which have potential for use in biological attacks. 
For example, some of these biological agents, include, Bacillus anthracis (anthrax), 

30 Clostridium Botulinum and its toxin (botulism), Yersinia pestis (plague), variola major 
(smallpox), Francisella tularensis (tularemia), and viral hemorrhagic fevers 
[filoviruses (e.g., Ebola, Marburg], and arenaviruses [e.g., Lassa, Machupo]), all of 
which are currently classified as Category A agents; Coxiella burnetti (Q fever); 
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Brucella species (brucellosis), Burklrolderia mallei (glanders), Burkholderia 
pseudomallei (meloidosis), Ricinus communis and its toxin (ricin toxin), Clostridium 
perfringens and its toxin (epsilon toxin), Staphylococcus species and their toxins 
(enterotoxin B), Chlamydia psittaci (psittacosis), water safety threats (e.g., Vibrio 
5 cholerae, Cryptosporidium parvum), Typhus fever {Rickettsia powazekii), and viral 
encephalitis (alphaviruses, e.g., Venezuelan equine encephalitis; eastern equine 
encephalitis; western equine encephalitis); all of which are currently classified as 
Category B agents; and Nipan virus and hantaviruses, which are currently classified 
as Category C agents. In addition, other organisms, which are so classified or 

10 differently classified, may be identified and/or used for such a purpose in the future. 
It will be readily understood that the viral vectors and other constructs described 
herein are useful to deliver antigens from these organisms, viruses, their toxins or 
other by-products, which will prevent and/or treat infection or other adverse reactions 
with these biological agents. 

1 5 Administration of the vectors of the invention to deliver immunogens 

against the variable region of the T cells elicit an immune response including CTLs to 
eliminate those T cells. In RA, several specific variable regions of TCRs which are 
involved in the disease have been characterized. These TCRs include V-3, V-14, 
V-17 and Va-17. Thus, delivery of a nucleic acid sequence that encodes at least one 

20 of these polypeptides will elicit an immune response that will target T cells involved 
in RA. In MS, several specific variable regions of TCRs which are involved in the 
disease have been characterized. These TCRs include V-7 and Va- 10. Thus, delivery 
of a nucleic acid sequence that encodes at least one of these polypeptides will elicit an 
immune response that will target T cells involved in MS. In scleroderma, several 

25 specific variable regions of TCRs which are involved in the disease have been 

characterized. These TCRs include V-6, V-8, V-14 and Va-16, Va-3C, Va-7, Va-14, 
Va-15, Va-16, Va-28 and Va-12. Thus, delivery of a chimeric adenovirus that 
encodes at least one of these polypeptides will elicit an immune response that will 
target T cells involved in scleroderma. 

30 C. Ad-Mediated Delivery Methods 

The therapeutic levels, or levels of immunity, of the selected gene can 
be monitored to determine the need, if any, for boosters. Following an assessment of 
CD8+ T cell response, or optionally, antibody titers, in the serum, optional booster 
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immunizations may be desired. Optionally, the adenoviral vectors of the invention 
may be delivered in a single administration or in various combination regimens, e.g., 
in combination with a regimen or course of treatment involving other active 
ingredients or in a prime-boost regimen. A variety of such regimens have been 
5 described in the art and may be readily selected. 

For example, prime-boost regimens may involve the administration of 
a DNA (e.g., plasmid) based vector to prime the immune system to a second or 
further, booster, administration with a traditional antigen, such as a protein or a 
recombinant virus carrying the sequences encoding such an antigen. See, e.g., WO 

10 00/1 1 140, published March 2, 2000, incorporated by reference. Alternatively, an 

immunization regimen may involve the administration of a chimeric adenoviral vector 
of the invention to boost the immune response to a vector (either viral or DNA-based) 
carrying an antigen, or a protein. In still another alternative, an immunization 
regimen involves administration of a protein followed by booster with a vector 

1 5 encoding the antigen. 

In one embodiment, the invention provides a method of priming and 
boosting an immune response to a selected antigen by delivering a plasmid DNA 
vector carrying said antigen, followed by boosting with an adenoviral vector of the 
invention. In one embodiment, the prime-boost regimen involves the expression of 

20 multiproteins from the prime and/or the boost vehicle. See, e.g. , R.R. Amara, 
Science, 292:69-7 r 4 (6 April 2001) which describes a multiprotein regimen for 
expression of protein subunits useful for generating an immune response against HIV 
and SIV. For example, a DNA prime may deliver the Gag, Pol, Vif, VPX and Vpr 
and Env, Tat, and Rev from a single transcript. Alternatively, the SIV Gag, Pol and 

25 HIV-1 Env is delivered in a recombinant adenovirus construct of the invention. Still 
other regimens are described in WO 99/16884 and WO 01/54719. 

However, the prime-boost regimens are not limited to immunization 
for HTV or to delivery of these antigens. For example, priming may involve 
delivering with a first vector of the invention followed by boosting with a second 

30 vector, or with a composition containing the antigen itself in protein form. In one 
example, the prime-boost regimen can provide a protective immune response to the 
virus, bacteria or other organism from which the antigen is derived. In another 
desired embodiment, the prime-boost regimen provides a therapeutic effect that can 
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be measured using convention assays for detection of the presence of the condition 
for which therapy is being administered. 

The priming composition may be administered at various sites in the 
body in a dose dependent manner, which depends on the antigen to which the desired 
5 immune response is being targeted. The invention is not limited to the amount or 
situs of injection(s) or to the pharmaceutical carrier. Rather, the regimen may 
involve a priming and/or boosting step, each of which may include a single dose or 
dosage that is administered hourly, daily, weekly or monthly, or yearly. As an 
example, the mammals may receive one or two doses containing between about 10 \ig 

10 to about 50 \ig of plasmid in carrier. A desirable amount of a DNA composition 

ranges between about 1 ng to about 10,000 \xg of the DNA vector. Dosages may vary 
from about 1 \xg to 1000 ng DNA per kg of subject body weight. The amount or site 
of delivery is desirably selected based upon the identity and condition of the mammal. 
The dosage unit of the vector suitable for delivery of the antigen to the mammal is 

1 5 described herein. The vector is prepared for administration by being suspended or 
dissolved in a pharmaceutically or physiologically acceptable carrier such as isotonic 
saline; isotonic salts solution or other formulations that will be apparent to those 
skilled in such administration. The appropriate carrier will be evident to those skilled 
in the art and will depend in large part upon the route of administration. The 

20 compositions of the invention may be administered to a mammal according to the 
routes described above, in a sustained release formulation using a biodegradable 
biocompatible polymer, or by on-site delivery using micelles, gels and liposomes. 
Optionally, the priming step of this invention also includes administering with the 
priming composition, a suitable amount of an adjuvant, such as are defined herein. 

25 Preferably, a boosting composition is administered about 2 to about 

27 weeks after administering the priming composition to the mammalian subject. The 
administration of the boosting composition is accomplished using an effective amount 
of a boosting composition containing or capable of delivering the same antigen as 
administered by the priming DNA vaccine. The boosting composition may be 

30 composed of a recombinant viral vector derived from the same viral source (e.g., 
adenoviral sequences of the invention) or from another source. Alternatively, the 
"boosting composition" can be a composition containing the same antigen as encoded 
in the priming DNA vaccine, but in the form of a protein or peptide, which 
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composition induces an immune response in the host. In another embodiment, the 
boosting composition contains a DNA sequence encoding the antigen under the 
control of a regulatory sequence directing its expression in a mammalian cell, e.g., 
vectors such as well-known bacterial or viral vectors. The primary requirements of 
5 the boosting composition are that the antigen of the composition is the same antigen, 
or a cross-reactive antigen, as that encoded by the priming composition. 

In another embodiment, the adenoviral vectors of the invention are 
also well suited for use in a variety of other immunization and therapeutic regimens. 
Such regimens may involve delivery of adenoviral vectors of the invention 

1 0 simultaneously or sequentially with Ad vectors of different serotype capsids, 

regimens in which adenoviral vectors of the invention are delivered simultaneously or 
sequentially with non-Ad vectors, regimens in which the adenoviral vectors of the 
invention are delivered simultaneously or sequentially with proteins, peptides, and/or 
other biologically useful therapeutic or immunogenic compounds. Such uses will be 

1 5 readily apparent to one of skill in the art. 

IV. Simian Adenovirus 1 8 Sequences 

The invention provides nucleic acid sequences and amino acid sequences of 
Ad SA18, which are isolated from the other viral material with which they are 

20 associated in nature. These sequences are useful in preparing heterologous molecules 
containing the nucleic acid sequences and amino acid sequences, and regions or 
fragments thereof as are described herein, viral vectors which are useful for a variety 
of purposes, including the constructs and compositions, and such methods as are 
described herein for the chimeric adenoviruses, including, e.g., in host cells for 

25 production of viruses requiring adenoviral helper functions, as delivery vehicles for 
heterologous molecules such as those described herein. These sequences are also 
useful in generating the chimeric adenoviruses of the invention. 
A. Nucleic Acid Sequences 

The SA18 nucleic acid sequences of the invention include nucleotides 

30 SEQ ID NO: 12, nt 1 to 31967. See, Sequence Listing, which is incorporated by 

reference herein. The nucleic acid sequences of the invention further encompass the 
strand which is complementary to the sequences of SEQ ID NO: 12, as well as the 
RNA and cDNA sequences corresponding to the sequences of these sequences figures 
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and their complementary strands. Further included in this invention are nucleic acid 
sequences which are greater than 95 to 98%, and more preferably about 99 to 99.9% 
homologous or identical to the Sequence Listing. Also included in the nucleic acid 
sequences of the invention are natural variants and engineered modifications of the 
5 sequences provided in SEQ ID NO: 12 and their complementary strands. Such 

modifications include, for example, labels that are known in the art, methylation, and 
substitution of one or more of the naturally occurring nucleotides with a degenerate 
nucleotide. 

The invention further encompasses fragments of the sequences of 
10 SA1 8, their complementary strand, cDNA and RNA complementary thereto. Suitable 
fragments are at least 15 nucleotides in length, and encompass functional fragments, 
i.e., fragments which are of biological interest. For example, a functional fragment 
can express a desired adenoviral product or may be useful in production of 
recombinant viral vectors. Such fragments include the gene sequences and fragments 
1 5 listed in the tables below. 

The following tables provide the transcript regions and open reading 
frames in the simian adenovirus sequences of the invention. For certain genes, the 
transcripts and open reading frames (ORFs) are located on the strand complementary 
to that presented in SEQ ID NO: 12. See, e.g., E2b, E4 and E2a. The calculated 
20 molecular weights of the encoded proteins are also shown. 
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The SA18 adenoviral nucleic acid sequences are useful as therapeutic and 
immunogenic agents and in construction of a variety of vector systems and host cells. 
5 Such vectors are useful for any of the purposes described above for the chimeric 
adenovirus. Additionally, these SA18 sequences and products may be used alone or 
in combination with other adenoviral sequences or fragments, or in combination with 
elements from other adenoviral or non-adenoviral sequences. The adenoviral 
sequences of the invention are also useful as antisense delivery vectors, gene therapy 
10 vectors, or vaccine vectors, and in methods of using same. Thus, the invention further 
provides nucleic acid molecules, gene delivery vectors, and host cells which contain 
the Ad sequences of the invention. 

For example, the invention encompasses a nucleic acid molecule containing 
simian Ad ITR sequences of the invention. In another example, the invention 
1 5 provides a nucleic acid molecule containing simian Ad sequences of the invention 
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encoding a desired Ad gene product. Still other nucleic acid molecule constructed 
using the sequences of the invention will be readily apparent to one of skill in the art, 
in view of the information provided herein. 

In one embodiment, the simian Ad gene regions identified herein may be used 
5 in a variety of vectors for delivery of a heterologous molecule to a cell. Examples of 
such molecules and methods of delivery are provided in Section HI herein. For 
example, vectors are generated for expression of an adenoviral capsid protein (or 
fragment thereof) for purposes of generating a viral vector in a packaging host cell. 
Such vectors may be designed for expression in trans. Alternatively, such vectors are 

10 designed to provide cells which stably contain sequences which express desired 
adenoviral functions, e.g., one or more of El a, Elb, the terminal repeat sequences, 
E2a, E2b, E4, E40RF6 region. 

In addition, the adenoviral gene sequences and fragments thereof are useful 
for providing the helper functions necessary for production of helper-dependent 

15 viruses (e.g., adenoviral vectors deleted of essential functions or adeno-associated 
viruses (AAV)). For such production methods, the simian adenoviral sequences of 
the invention are utilized in such a method in a manner similar to those described for 
the human Ad. However, due to the differences in sequences between the simian 
adenoviral sequences of the invention and those of human Ad, the use of the 

20 sequences of the invention essentially eliminate the possibility of homologous 

recombination with helper functions in a host cell carrying human Ad El functions, 
e.g., 293 cells, which may produce infectious adenoviral contaminants during rAAV 
production. 

Methods of producing rAAV using adenoviral helper functions have been 
25 described at length in the literature with human adenoviral serotypes. See, e.g., US 
Patent 6,258,595 and the references cited therein. See, also, US Patent 5,871,982; 
WO 99/14354; WO 99/15685; WO 99/47691 . These methods may also be used in 
production of non-human serotype AAV, including non-human primate AAV 
serotypes. The simian adenoviral gene sequences of the invention which provide the 
30 necessary helper functions (e.g., Ela, Elb, E2a and/or E4 ORF6) can be particularly 
useful in providing the necessary adenoviral function while minimizing or eliminating 
the possibility of recombination with any other adenoviruses present in the rAAV- 
packaging cell which are typically of human origin. Thus, selected genes or open 
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reading frames of the adenoviral sequences of the invention may be utilized in these 
rAAV production methods. 

Alternatively, recombinant adenoviral simian vectors of the invention may be 
utilized in these methods. Such recombinant adenoviral simian vectors may include, 
5 e.g. , a hybrid simian Ad/AAV in which simian Ad sequences flank a rAAV 

expression cassette composed of, e.g., AAV 3 5 and/or 5 5 ITRs and a transgene under 
the control of regulatory sequences which control its expression. One of skill in the 
art will recognize that still other simian adenoviral vectors and/or gene sequences of 
the invention will be useful for production of rAAV and other viruses dependent upon 

1 0 adenoviral helper. 

In still another embodiment, nucleic acid molecules are designed for 
delivery and expression of selected adenoviral gene products in a host cell to achieve 
a desired physiologic effect. For example, a nucleic acid molecule containing 
sequences encoding an adenovirus El a protein of the invention may be delivered to a 

1 5 subject for use as a cancer therapeutic. Optionally, such a molecule is formulated in a 
lipid-based carrier and preferentially targets cancer cells. Such a formulation may be 
combined with other cancer therapeutics {e.g., cisplatin, taxol, or the like). Still other 
uses for the adenoviral sequences provided herein will be readily apparent to one of 
skill in the art. 

20 In addition, one of skill in the art will readily understand that the Ad 

sequences of the invention can be readily adapted for use for a variety of viral and 
non-viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and 
immunogenic molecules, including any of those identified as being deliverable via the 
chimeric adenoviruses of the invention. For example, the simian Ad genome of the 

25 invention can be utilized in a variety of rAd and non-rAd vector systems. Such 
vectors systems may include, e.g., plasmids, lentiviruses, retroviruses, poxviruses, 
vaccinia viruses, and adeno-associated viral systems, among others. Selection of these 
vector systems is not a limitation of the present invention. 

The invention further provides molecules useful for production of the 

30 simian and simian-derived proteins of the invention. Such molecules which carry 
polynucleotides including the simian Ad DNA sequences of the invention can be in 
the form of a vector. 
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B. Simian Adenoviral Proteins of the Invention 

The invention further provides gene products of the above 
adenoviruses, such as proteins, enzymes, and fragments thereof, which are encoded 
by the adenoviral nucleic acids of the invention. The invention further encompasses 
5 SA1 8 proteins, enzymes, and fragments thereof, having the amino acid sequences 
encoded by these nucleic acid sequences which are generated by other methods. Such 
proteins include those encoded by the open reading frames identified in the tables 
above, and fragments thereof. 

Thus, in one aspect, the invention provides unique simian adenoviral 
10 proteins which are substantially pure, i.e., are free of other viral and proteinaceous 
proteins. Preferably, these proteins are at least 10% homogeneous, more preferably 
60% homogeneous, and most preferably 95% homogeneous. 

In one embodiment, the invention provides unique simian-derived 
capsid proteins. As used herein, a simian-derived capsid protein includes any 
15 adenoviral capsid protein that contains a SA18 capsid protein or a fragment thereof, 
as defined above, including, without limitation, chimeric capsid proteins, fusion 
proteins, artificial capsid proteins, synthetic capsid proteins, and recombinantly capsid 
proteins, without limitation to means of generating these proteins. 

Suitably, these simian-derived capsid proteins contain one or more 
20 SA18 regions or fragments thereof (e.g., a hexon, penton, fiber or fragment thereof) in 
combination with capsid regions or fragments thereof of different adenoviral 
serotypes, or modified simian capsid proteins or fragments, as described herein. A 
"modification of a capsid protein associated with altered tropism" as used herein 
includes an altered capsid protein, i.e, a penton, hexon or fiber protein region, or 
25 fragment thereof, such as the knob domain of the fiber region, or a polynucleotide 
encoding same, such that specificity is altered. The simian-derived capsid may be 
constructed with one or more of the simian Ad of the invention or another Ad 
serotypes which may be of human or non-human origin. Such Ad may be obtained 
from a variety of sources including the ATCC, commercial and academic sources, or 
30 the sequences of the Ad may be obtained from GenBank or other suitable sources. 

The amino acid sequences of the simian adenoviruses penton proteins 
of the invention are provided herein. The AdSAl 8 penton protein is provided in SEQ 
ID NO: 13. Suitably, any of these penton proteins, or unique fragments thereof, may 
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be utilized for a variety of purposes. Examples of suitable fragments include the 
penton having N-terminal and/or C-terminal truncations of about 50, 100, 150, or 200 
amino acids, based upon the amino acid numbering provided above. Other suitable 
fragments include shorter internal, C-terminal, or N-terminal fragments. Further, the 
5 penton protein may be modified for a variety of purposes known to those of skill in 
the art. 

The invention further provides the amino acid sequences of the hexon 
protein of SA1 8, SEQ ID NO: 14. Suitably, this hexon protein, or unique fragments 
thereof, may be utilized for a variety of purposes. Examples of suitable fragments 

10 include the hexon having N-terminal and/or C-terminal truncations of about 50, 100, 
150, 200, 300, 400, or 500 amino acids, based upon the amino acid numbering 
provided above and in SEQ ID NO: 14. Other suitable fragments include shorter 
internal, C-terminal, or N-terminal fragments. For example, one suitable fragment the 
loop region (domain) of the hexon protein, designated DEI and FG1, or a 

1 5 hypervariable region thereof. Such fragments include the regions spanning amino 
acid residues about 125 to 443; about 138 to 441, or smaller fragments, such as those 
spanning about residue 138 to residue 163; about 170 to about 176; about 195 to 
about 203; about 233 to about 246; about 253 to about 264; about 287 to about 297; 
about 404 to about 430, about 430 to 550, about 545 to 650; of the simian hexon 

20 proteins, with reference to SEQ ID NO: 14. Other suitable fragments may be readily 
identified by one of skill in the art Further, the hexon protein may be modified for a 
variety of purposes known to those of skill in the art. Because the hexon protein is 
the determinant for serotype of an adenovirus, such artificial hexon proteins would 
result in adenoviruses having artificial serotypes. Other artificial capsid proteins can 

25 also be constructed using the chimp Ad penton sequences and/or fiber sequences of 
the invention and/or fragments thereof. 

In one example, it may be desirable to generate an adenovirus having 
an altered hexon protein utilizing the sequences of a hexon protein of the invention. 
One suitable method for altering hexon proteins is described in US Patent 5,922,315, 

30 which is incorporated by reference. In this method, at least one loop region of the 
adenovirus hexon is changed with at least one loop region of another adenovirus 
serotype. Thus, at least one loop region of such an altered adenovirus hexon protein 
is a simian Ad hexon loop region of the invention. In one embodiment, a loop region 
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of the SA18 hexon protein is replaced by a loop region from another adenovirus 
serotype. In another embodiment, the loop region of the SA18 hexon is used to 
replace a loop region from another adenovirus serotype. Suitable adenovirus 
serotypes may be readily selected from among human and non-human serotypes, as 
5 described herein. SA1 8 is selected for purposes of illustration only; the other simian 
Ad hexon proteins of the invention may be similarly altered, or used to alter another 
Ad hexon. The selection of a suitable serotype is not a limitation of the present 
invention. Still other uses for the hexon protein sequences of the invention will be 
readily apparent to those of skill in the art. 
10 The invention further encompasses the fiber proteins of the simian 

adenoviruses of the invention. The fiber protein of AdSAl 8 has the amino acid 
sequence of SEQ ID NO: 15. Suitably, this fiber protein, or unique fragments thereof, 
may be utilized for a variety of purposes. One suitable fragment is the fiber knob, 
which spans about amino acids 247 to 425 of SEQ ID NO: 15. Examples of other 

1 5 suitable fragments include the fiber having N-terminal and/or C-terminal truncations 
of about 50, 100, 150, or 200 amino acids, based upon the amino acid numbering 
provided above and in SEQ ID NO: 15. Still other suitable fragments include internal 
fragments. Further, the fiber protein may be modified using a variety of techniques 
known to those of skill in the art. 

20 The invention further encompasses unique fragments of the proteins 

of the invention which are at least 8 amino acids in length. However, fragments of 
other desired lengths can be readily utilized. In addition, the invention encompasses 
such modifications as may be introduced to enhance yield and/or expression of an 
SA18 gene product, e.g., construction of a fusion molecule in which all or a fragment 

25 of the SA1 8 gene product is fused (either directly or via a linker) with a fusion 
partner to enhance. Other suitable modifications include, without limitation, 
truncation of a coding region (e.g., a protein or enzyme) to eliminate a pre- or pro- 
protein ordinarily cleaved and to provide the mature protein or enzyme and/or 
mutation of a coding region to provide a secretable gene product. Still other 

30 modifications will be readily apparent to one of skill in the art. The invention further 
encompasses proteins having at least about 95% to 99% identity to the SA1 8 proteins 
provided herein. 
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As described herein, vectors of the invention containing the adenoviral capsid 
proteins of the invention are particularly well suited for use in applications in which 
the neutralizing antibodies diminish the effectiveness of other Ad serotype based 
vectors, as well as other viral vectors. The rAd vectors of the invention are 
5 particularly advantageous in readministration for repeat gene therapy or for boosting 
immune response (vaccine titers). Examples of such regimens are provided herein. 

Under certain circumstances, it may be desirable to use one or more of 
the SA18 gene products (e.g., a capsid protein or a fragment thereof) to generate an 
antibody. The term "an antibody, 11 as used herein, refers to an immunoglobulin 

10 molecule which is able to specifically bind to an epitope. Thus, the antibodies of the 
invention bind, preferably specifically and without cross-reactivity, to a SA18 
epitope. The antibodies in the present invention exist in a variety of forms including, 
for example, high affinity polyclonal antibodies, monoclonal antibodies, synthetic 
antibodies, chimeric antibodies, recombinant antibodies and humanized antibodies. 

15 Such antibodies originate from immunoglobulin classes IgG, IgM, IgA, IgD and IgE. 

Such antibodies may be generated using any of a number of methods 
know in the art. Suitable antibodies may be generated by well-known conventional 
techniques, e.g. Kohler and Milstein and the many known modifications thereof. 
Similarly desirable high titer antibodies are generated by applying known 

20 recombinant techniques to the monoclonal or polyclonal antibodies developed to 
these antigens [see, e.g., PCT Patent Application No. PCT/GB85/00392; British 
Patent Application Publication No. GB2188638A; Amit et al, 1986 Science, 
233:747-753; Queen etal, 1989 Proc. Natl Acad Sci. USA, 56:10029-10033; PCT 
Patent Application No. PCT/WO9007861; and Riechmann et al, Nature, 332:323- 

25 327 (1988); Huse et al, 1988a Science, 246:1275-1281]. Alternatively, antibodies can 
be produced by manipulating the complementarity determining regions of animal or 
human antibodies to the antigen of this invention. See, e.g., E. Mark and Padlin, 
"Humanization of Monoclonal Antibodies", Chapter 4, The Handbook of 
Experimental Pharmacology, Vol. 113, The Pharmacology of Monoclonal Antibodies, 

30 Springer- Verlag (June, 1994); Harlow et al, 1999, Using Antibodies: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, NY; Harlow et al , 1 989, Antibodies: 
A Laboratory Manual, Cold Spring Harbor, New York; Houston et al, 1988, Proc. 
Natl Acad. Sci. USA 55:5879-5883; and Bird et al, 1988, Science 242:423426. 
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Further provided by the present invention are anti-idiotype antibodies (Ab2) and anti- 
anti-idiotype antibodies (Ab3). See, e.g., M. Wettendorff et al., "Modulation of anti- 
tumor immunity by anti-idiotypic antibodies." In Idiotypic Network and Diseases, ed. 
by J. Cerny and J. Hiernaux, 1990 J. Am. Soc. Microbiol., Washington DC: pp. 203- 
5 229]. These anti-idiotype and anti-anti-idiotype antibodies are produced using 

techniques well known to those of skill in the art. These antibodies may be used for a 
variety of purposes, including diagnostic and clinical methods and kits. 

Under certain circumstances, it may be desirable to introduce a 
detectable label or a tag onto a SA18 gene product, antibody or other construct of the 
10 invention. As used herein, a detectable label is a molecule which is capable, alone or 
upon interaction with another molecule, of providing a detectable signal. Most 
desirably, the label is detectable visually, e.g. by fluorescence, for ready use in 
immunohistochemical analyses or immunofluorescent microscopy. For example, 
suitable labels include fluorescein isothiocyanate (FITC), phycoerythrin (PE), 
1 5 allophycocyanin (APC), coriphosphine-0 (CPO) or tandem dyes, PE-cyanin-5 (PC5), 
and PE-Texas Red (ECD). All of these fluorescent dyes are commercially available, 
and their uses known to the art. Other useful labels include a colloidal gold label. 
Still other useful labels include radioactive compounds or elements. Additionally, 
labels include a variety of enzyme systems that operate to reveal a colorimetric signal 
20 in an assay, e.g. , glucose oxidase (which uses glucose as a substrate) releases 

peroxide as a product which in the presence of peroxidase and a hydrogen donor such 
as tetramethyl benzidine (TMB) produces an oxidized TMB that is seen as a blue 
color. Other examples include horseradish peroxidase (HRP) or alkaline phosphatase 
(AP), and hexokinase in conjunction with glucose-6-phosphate dehydrogenase which 
25 reacts with ATP, glucose, and NAD+ to yield, among other products, NADH that is 
detected as increased absorbance at 340 nm wavelength. 

Other label systems that are utilized in the methods of this invention 
are detectable by other means, e.g., colored latex microparticles [Bangs Laboratories, 
Indiana] in which a dye is embedded are used in place of enzymes to form conjugates 
30 with the target sequences provide a visual signal indicative of the presence of the 
resulting complex in applicable assays. 

Methods for coupling or associating the label with a desired molecule 
are similarly conventional and known to those of skill in the art Known methods of 



47 



WO 2005/001103 



PCT/US2004/016614 



label attachment are described [see, for example, Handbook of Fluorescent probes 
and Research Chemicals, 6th Ed., R. P. M. Haugland, Molecular Probes, Inc., 
Eugene, OR, 1996; Pierce Catalog and Handbook, Life Science and Analytical 
Research Products, Pierce Chemical Company, Rockford, IL, 1994/1995]. Thus, 
5 selection of the label and coupling methods do not limit this invention. 

The sequences, proteins, and fragments of the invention may be 
produced by any suitable means, including recombinant production, chemical 
synthesis, or other synthetic means. Suitable production techniques are well known 
tothoseofskillintheart. See, e.g., Sambrook et al, Molecular Cloning: A 
1 0 Laboratory Manual, Cold Spring Harbor Press (Cold Spring Harbor, NY). 

Alternatively, peptides can also be synthesized by the well known solid phase peptide 
synthesis methods (Merrifield, J. Am. Chem. Soc, 85:2149 (1962); Stewart and 
Young, Solid Phase Peptide Synthesis (Freeman, San Francisco, 1969) pp. 27-62). 
These and other suitable production methods are within the knowledge of those of 
15 skill in the art and are not a limitation of the present invention. 

In addition, one of skill in the art will readily understand that the Ad 
sequences of the invention can be readily adapted for use for a variety of viral and 
non-viral vector systems for in vitro, ex vivo or in vivo delivery of therapeutic and 
immunogenic molecules. For example, in one embodiment, the simian Ad capsid 
20 proteins and other simian adenovirus proteins described herein are used for non-viral, 
protein-based delivery of genes, proteins, and other desirable diagnostic, therapeutic 
and immunogenic molecules. In one such embodiment, a protein of the invention is 
linked, directly or indirectly, to a molecule for targeting to cells with a receptor for 
adenoviruses. Preferably, a capsid protein such as a hexon, penton, fiber or a fragment 
t5 thereof having a ligand for a cell surface receptor is selected for such targeting. 
Suitable molecules for delivery are selected from among the therapeutic molecules 
described herein and their gene products. A variety of linkers including, lipids, 
polyLys, and the like may be utilized as linkers. For example, the simian penton 
protein may be readily utilized for such a purpose by production of a fusion protein 
0 using the simian penton sequences in a manner analogous to that described in 

Medina-Kauwe LK, et al, Gene Ther. 2001 May; 8(10):795-803 and Medina-Kauwe 
LK, et al, Gene Ther. 2001 Dec; 8(23): 1753-1761. Alternatively, the amino acid 
sequences of simian Ad protein IX may be utilized for targeting vectors to a cell 



48 



WO 2005/001103 



PCT/US2004/016614 



10 



15 



20 



25 



30 



surface receptor, as described in US Patent Appln 20010047081. Suitable iigands 
include a CD40 antigen, an RGD-containing or polylysine-containing sequence, and 
the like. Still other simian Ad proteins, including, e.g., the hexon protein and/or the 
fiber protein, may be used for used for these and similar purposes. 

Still other adenoviral proteins of the invention may be used as alone, 
or in combination with other adenoviral protein, for a variety of purposes which will' 
be readily apparent to one of skill in the art. In addition, still other uses for the 
adenoviral proteins of the invention will be readily apparent to one of skill in the art 

The compositions of this invention include vectors that deliver a 
heterologous molecule to cells, either for therapeutic or vaccine purposes. Such 
vectors, containing simian adenovirus DNA of SA18 and a minigene, can be 
constructed using techniques such as those described herein for the chimeric 
adenoviruses and such techniques as are known in the art. Alternatively, SA19 may 
be a source for sequences of the chimeric adenoviruses are described herein. 

The Mowing examples illustrate construction and use of several 
chimeric viruses, including Pan5/Cl, hu5/Pan7 and hu5/SV25, and Pan6/Pan7. 
However, these chimera are illustrative only and are not intended to limit the 
invention to those illustrated embodiments. 

Example 1 - Construction of Pan5/C1 Chimeric Simian Viruses 

Five different adenoviruses initially isolated from the chimpanzee, AdC68 
[US Patent 6,083,716], AdPan5, AdPan7, AdPan6 and AdCl [US Patent 6,083,716] 
have been sequenced. See, International Application No. PCT/US02/33645, filed 
November 2002 for the sequences of Pan5 [SEQ ID NO: 1], Pan7 [SEQ ID NO:3], 
and Pan6 [SEQ ID NO:2]. This application also provides sequences for SV1, SV25 
and SV39 [SEQ ID No. 4, 5, 6, respectively]. Sequence comparison of the capsid 
protein sequences predicted that AdCl clearly belonged to a different serological 
subgroup than the other four chimpanzee derived adenoviruses. 

However, attempts to cultivate AdCl in HEK293 cells revealed it to be 
fastidious in its growth characteristics (data not shown) and therefore possibly 
unsuitable for use as a vector using the currently available El complementing cell 
lines. However, because of the obvious sequence dissimilarity of AdCl capsid 
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protein sequence from the other chimpanzee derived adenoviruses (as well as the 
huAd5), chimeric adenovirus vectors were generated with the capsid characteristics of 
AdCl. In view of the above-mentioned drawbacks associated with only making 
hexon changes, more extensive replacements were made in the chimera described 
5 herein, i. e. , construction of chimeras where the replacement went beyond just the 
hexon, to achieve two goals. The first was to determine whether making extended 
replacements would allow for the rescue of viruses containing hexons of unrelated 
serotypes that may not otherwise be amenable to rescue. The second goal was to test 
whether the growth characteristics of adenovirus vectors such as AdPan5, that have 
1 0 been found in our laboratory to be able to be grown to high titer for the purpose of 
manufacture, would also be present in the chimeric virus, particularly when the hexon 
(and other capsid proteins) are derived from a virus such as AdCl that are difficult to 
grow to a high yield in cell lines such as HEK293. An added bonus of extending the 
replacement to include the fiber protein would be to further increase the antigenic 
1 5 dissimilarity to beyond that afforded by a hexon change alone. 

As an alternative to obtaining purified virus as source for adenoviral DNA to 
sequence, we have resorted to cloning restriction fragments of viral DNA obtained 
from infected cells ("Hirt prep"). The first adenovirus we have sequenced in this way 
is Simian Adenovirus. EcoRI digestion of the Simian Adenovirus yielded 7 
20 fragments. Shotgun cloning yielded clones of the 5 internal fragments, which were 
cloned and sequenced. Completion of the sequencing was carried out by walking 
towards each of the ends of the genome. The map of the genome is shown in Figure 
1. 

A. Construction of Two Pan5/Cl Chimeric Plasmids 
25 The overall approach towards constructing chimeric viruses was to 

first assemble the complete El deleted virus DNA into a single plasmid flanked by 
recognition sites for the restriction enzyme Swal, digest the plasmid DNA with Swal 
to release the virus DNA ends, and transfect the DNA into HEK293 cells to determine 
whether viable chimeric adenovirus could be rescued. Two chimeric virus plasmids 
30 were constructed, p5Cl short and p5Cl long. 

The plasmid p5Clshort harbors an El deleted Pan5 virus where an 
internal 15226 bp segment (18332 - 33557) has been replaced by a functionally 
analogous 14127 bp (18531 - 32657) from AdCl. This results in the replacement of 
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the Pan5 proteins hexon, endoprotease, DNA binding protein, 100 kD scaffolding 
protein, 33 kD protein, protein VIII, and fiber, as well as the entire E3 region, with 
the homologous segment from AdCl . The Clal site at the left end of the AdCl 
fragment is at the beginning of the hexon gene and the resulting protein is identical to 
5 the CI hexon. The EcoRI site which constitutes the right end of the AdCl fragment 
is within the E4 orf 7 part of the AdCl . The right end was ligated to a PGR generated 
right end fragment from AdPan5 such that the regenerated orf 7-translation product is 
chimeric between AdPanS and AdCl . 

The plasmid p5Cllong harbors an El deleted Pan5 virus where an 

10 internal 25603 bp segment (7955 - 33557) has been replaced by a functionally 

analogous 24712 bp (7946 - 32657) from AdCl. This results in the replacement of 
the AdPanS pre-terminal protein, 52/55 kD protein, penton base protein, protein VII, 
Mu, and protein VI with those from AdCl in addition to those replaced in p5Cl short. 
The AscI site at the left end of the AdCl fragment is at the beginning of the DNA 

15 polymerase gene and results in a chimeric protein where the first 165 amino acids of 
the AdPan5 DNA polymerase has been replaced by a 167 amino acid segment from 
AdCl DNA polymerase. In this N-terminal region, the homology between the 
AdPanS and AdCl DNA polymerase proteins is 81% (72% identity). 

The plasmid pDVP5Mlu which contains the left end of AdPan5 was 

20 used as the starting plasmid for the chimeric vector construction. 

The plasmid pDVP5Mlu was made as follows. A synthetic DNA 
fragment harboring recognition sites for the restriction enzymes Smal, MM, EcoRI 
and EcoRV respectively was ligated into pBR322 digested with EcoRI and Ndel so as 
to retain the origin of replication and the beta-lactamase gene. The left end of Pan5 

25 extending to the Mlul site (15135 bp) was cloned into this plasmid between the Smal 
and Mlul sites. The El gene was functionally deleted and replaced by a DNA 
fragment harboring recognition sites for the extremely rare cutter restriction enzyme 
sites I-Ceul and Pl-Scel). The 2904 base pairs of the right end of Pan-5 was PCR 
amplified using the primers P5L [GCG CAC GCG TCT CTA TCG ATG AAT TCC 

30 ATT GGT GAT GGA CAT GC, SEQ ED NO:7] and P5ITR [GCG CAT TTA AAT 
CAT CAT CAA TAA TAT ACC TCA AAC, SEQ ID NO:8] using Tgo polymerase 
(Roche). The PCR product was cut with Mlul and Swal, and cloned between Mlul 
and EcoRV of pDVPSMlu to yield pPan5Mlu+RE. A 3 193 bp fragment extending 
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from the Mlul site (1 5135) to the Clal (18328) site of Pan5 was then inserted between 
the same sites of pPan5MIu+RE to yield pPan5Cla+RE. The 3671 bp Clal (18531) to 
EcoRI (22202) fragment of the adenovirus CI was cloned into pPan5Cla+RE between 
Clal (161 1 1) and EcoRI (161 16) to yield pPan5CldelRL The 10452 bp internal 
5 EcoRI fragment of the adenovirus CI (22202 - 32653) was cloned into the EcoRI site 
of pPanSCldelRI to yield p5Clshort. To construct p5Cllong, the AdCl replacement 
was further extended by replacing the AscI - Clal 10379 bp fragment of AdPanS in 
p5 CI short with the AdCl AscI - Clal 10591 bp fragment. Finally a green fluorescent 
protein (GFP) expression cassette was inserted into both p5Clshort and p5Cllong 
10 between the I-Ceul and Pl-Scel sites to yield pSClshortGFP and p5CllongGFP 
respectively. 

B. Rescue of chimeric Pan5/Cl recombinant vector adenoviruses 

The plasmids p5ClshortGFP and pSCllongGFP were digested with 
the restriction enzyme Swal and transfected into HEK 293 cells. A typical adenovirus 

15 induced cytopathic effect was observed. The rescue of recombinant chimeric 
adenovirus from the p5CllongGFP transfection was confirmed by collecting the 
supernatant from the transfection and re-infecting fresh cells which were found to be 
transduced as determined by GFP expression. Viral DNA prepared from the chimeric 
recombinant virus was digested with several restriction enzymes and found to have 

20 the expected pattern on electrophoresis (data not shown). 

The chimeric adenoviral construct with the shorter replacement 
p5Cl short encodes the CI proteins hexon and fiber as well as the intervening open 
reading frames for endoprotease, DNA binding protein, 1 00 kDa scaffolding protein, 
33 kDa protein, and protein VIII. (The E3 region is also included within this region 

25 but is unlikely to impact on the viability of the chimeric virus). When the 

replacement was extended to include the additional AdCl proteins pTP (pre-terminal 
protein), 52/55 kDa protein, penton base, protein VII, Mu, and protein VI, there was 
no difficulty in rescuing viable chimeric virus. In this experiment, the chimeric 
adenovirus construction strategy utilized the presence of AscI and Clal restriction 

30 enzyme sites present on the genes for DNA polymerase and hexon respectively on 
both AdPanS and AdCl. 

The reasons for the relatively higher yield of the chimeric virus 
compared to the wild-type AdCl virus are not clear. In the growth of the 5C1 
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chimeric virus in 293 cells, the adenoviral early region gene products of El and E4 
are derived from Ad5 and AdPanS respectively. The El and E4 gene products bind, 
regulate and de-repress several cellular transcription complexes and coordinate their 
activity towards viral multiplication. Thus it is possible that the El gene products 
5 supplied in trans from the 293 cells and the E4 gene products from AdPanS are more 
optimal in the human 293 cell background than are the equivalent AdCl gene 
products. This may also apply to the activity of the major late promoter whose 
activity is responsible for the transcription of the capsid protein genes. In the 
chimeric virus, the major late promoter, and the protein IVa2 which transactivates it, 
10 are derived from AdPanS. However the E2 gene products required for adenoviral 
DNA replication pTP and single-stranded DNA - binding protein are derived from 
AdCl. The adenoviral DNA polymerase, which complexes with pTP, is chimeric in 
AdSCl but mostly AdPan5 derived. 

15 Example 2 - Construction of Ad5 Chimeric Simian Viruses 

Plasmids have been constructed where the structural proteins derive 
from the chimpanzee adenovirus Pan 7 and the flanking sequences are derived from 
human Ad5 (the commonly used vector strain). The Adhu5-Pan7 chimeric 
adenovirus has been rescued, demonstrating that the chimeric virus construction 

20 method used to derive the chimeric virus is broadly applicable. 

A. Construction of the Ad5 - Pan 7 chimeric adenovirus 
A plasmid was constructed which harbors the complete (El 
deleted) chimeric genome in order to establish that the chimeric adenovirus is viable, 
and then transfected the plasmid into the El complementing cell line HEK 293. It 

25 was found that the recombinant virus could be rescued. The chimeric adenovirus 
genome that was constructed is composed of a left end segment derived from Ad5 
that contributes the ITR, the El deletion region containing the transgene expression 
cassette, the pDC and IVa2 genes and 954 C-terminal amino acids of the polymerase 
gene (which is transcribed in the right to left direction from the bottom strand). Ad5 

30 also contributes the right end of the chimeric genome containing the E4 genes and the 
right ITR. All the other genes present in the central part of the chimeric construct are 
derived from the chimpanzee adenovirus Pan 7 including the N-terminal 235 amino 
acids of a chimeric DNA polymerase. 
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In order to construct the plasmid which harbors the complete (El 
deleted) chimeric genome, the starting plasmid was pBRAd51ere which is comprised 
of three parts; the bacterial origin of replication and ampicillin resistance gene 
derived from the plasmid pBR322, the left end of an Ad5 derived El deleted vector 
5 extending from the left ITR to the StuI site located at base pair number 5782 of the 
wild-type Ad5 genome (the El deletion extends from base pair 342 to 3533 of the 
wild-type Ad5 genome), and the right end of Ad5 extending from the StuI site at base 
pair number 3 1954 of the wild-type Ad5 genome to the right end of the right ITR. 
The Pad sites located adjacent to the two ITRs are used to release the Ad5 genome 
10 from the bacterial plasmid backbone. The fragment containing I-Ceul and the Pl-Scel 
sites which is located in place of the El deletion is used to insert transgene cassettes. 

A synthetic DNA oligomer was inserted at the StuI site containing 
sites for AscI, Xbal and EcoRI, which allowed the creation of the plasmid 
pAd5endsAscRI where using PCR, the Ad5 polymerase gene was extended to base 
15 pair #8068 of the wild-type Ad5 genome and incorporating a newly created AscI site 
at this location by silent mutagenesis of the polymerase gene (translated from the 
bottom strand) as depicted below. 

Original sequence 

GCGACGGGCCGA [SEQIDNO:16] 
20 CGC TGC CCG GCT 

Arg Arg Ala Ser [SEQ ID NO: 1 7] 

Mutated sequence (The AscI recognition site is underlined) 
GCGGCGCGCCGA [SEQ ID NO: 1 8] 
CGC TGC CCG GCT 
25 Arg Arg Ala Ser [SEQ ID NO: 17] 

The Pan 7 fiber containing region was amplified by PCR (mutating 
the fiber stop codon from TGA to TAA to provide a polyadenylation signal similar to 
that in Ad5) and inserted into the EcoRI site to yield pAd5endsP7fib. Several cloning 
steps led to the construction of pH5C7H5 where the complete chimeric adenoviral 
30 genome has been assembled A transgene cassette expressing GFP (green fluorescent 
protein) was inserted between the I-Ceul and Pl-Scel sites of pH5C7H5. The final 
construct was digested with Pad to separate the adenoviral genome from the plasmid 
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backbone and transfected into HEK 293 cells. The cell lysate was harvested 2 weeks 
later, and the chimeric adenovirus was amplified and purified by standard methods. 

B. Construction of the Ad5 - Simian virus 25 (SV-25) chimeric 

5 adenovirus 

[N.B. Simian virus 25 (ATCC catalog number VR-201) is 
distinct from the chimpanzee adenovirus Simian adenovirus 25 ATCC catalog 
number VR-594] 

The construction of the Ad5 based chimeric adenovirus where 

10 the left and right end segments are derived from Ad5 and the central portion was 
derived from the monkey adenovirus SV-25 was carried out in a manner completely 
analogous to that described above for the chimeric adenovirus described above that is 
chimeric between Ad5 and the chimpanzee adenovirus Pan 7. Thus, the chimeric 
adenovirus genome that was constructed is composed of a left end segment derived 

15 from Ad5 that contributes the ITR, the El deletion region containing the transgene 
expression cassette, the pIX and IVa2 genes and 956 C-terminal amino acids of the 
polymerase gene. Ad5 also contributes the right end of the chimeric genome 
containing the E4 genes and the right ITR. [Additionally, the left end of the Ad5 
genome was extended beyond that present in pH5C7H5 so that 454 base pairs of the 

20 Ad5 left end was present. Although not absolutely essential, this was done in order to 
improve packaging efficiency.] All the genes present in the central part of the 
chimeric construct are derived from the monkey adenovirus SV-25 including the N- 
terminal 230 amino acids of a chimeric DNA polymerase. The starting plasmid for 
the construction of the chimeric genome was pAdSendsAscRI which contains both 

25 the left and right ends of Ad5 as well as the created (by silent mutation) AscI site in 
the polymerase gene where Ad5-SV25 chimeric fusion was made (as was done for the 
Ad5 - Pan 7 chimeric adenovirus). In the final construct pH5S25H5, the SV25 
genome segment has been incorporated by sequential cloning steps, including 
creation of an AscI site at the ligation junction within the polymerase coding 

30 sequence. A transgene cassette expressing GFP (green fluorescent protein) was 

inserted between the I-Ceul and Pl-Scel sites of pH5S25H5. The final construct was 
digested with Pad to separate the adenoviral genome from the plasmid backbone and 
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transfected into HEK 293 cells. The cell lysate was harvested 2 weeks later, and the 
chimeric adenovirus was amplified and purified by standard methods. 

Fig. 2 provides the map of the recombinant Adhu5-SV25 chimeric virus. 
The portion of the genome replaced by DNA from Pan7 is indicated. 

5 

Example 3 - Pan5 - C1 chimeric vector of invention as a delivery vehicle 

for immunogenic compositions 

A Pan 5 (Simian adenovirus 22, a subgroup E adenovirus, also termed C5) 

- CI (Simian adenovirus 21, a subgroup B adenovirus) chimeric expressing the Ebola 
10 virus (Zaire) glycoprotein (C5C1C5-CMVGP) was constructed as a model antigen in 

order to test the efficacy of the vector C5C1C5-CMVGP as a vaccine; this vector has 

been compared it to the Adhu5 based vector (H5-CMVGP). Compared to H5- 

CMVGP, the C5C1-CMVGP vector yielded only a slightly decreased level of GP 

expression in transduced A549 cells. 
15 Thereafter, GP-specific T cell and B cell responses elicited in B 1 0BR mice 

vaccinated intramuscularly with either 5 x 10 10 H5-CMVGP or C5C1-CMVGP 

vectors were compared. 

The C5C1C5-CMVGP vector appeared to induce lower frequencies of gamma 

interferon producing CD8+ T cells with kinetics slower than the H5-CMVGP vector 
20 as determined by intracellular cytokine staining using a H-2k restricted GP-specific 

peptide as stimulant. The total IgG response to GP, measured by ELISA, was 

equivalent in serum from mice vaccinated with the C5C1C5-CMVGP or the H5- 

CMVGP vectors. However, the C5C1C5-CMVGP vector induced a more potent Thl 

type response while the H5-CMVGP vector stimulated a more balanced Thl/Th2 type 
25 response. In a survival study, mice were vaccinated as above and challenged 28 days 

later with 200 LD/50 mouse-adapted Ebola Zaire virus. 100% survival was seen for 

both groups. 

Example 4 - Generation of Chimeric Pan6/Pan7 Vectors 
30 A panel of GFP expressing vectors were generated. This panel includes, 

vectors that are chimeric between Pan 6 and Pan 7 where (a) the hexon protein of Pan 
7 was replaced by that of Pan 6 (termed C767), (b), the fiber protein of Pan7 was 
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replaced by that of Pan 6 (termed C776), (c) both the hexon and fiber proteins of the 
Pan 7 vector have been replaced by those from Pan 6 (termed C766). 

The chimeric virus termed C767 was constructed essentially as described 
above for the C5C1C5 virus in Example 1. However, due to substantial homology 
5 between the Pan6 and Pan7 sequences 5' to the hexon sequence, it was not necessary 
to substitute the 5' end of the genome between the penton and the pol gene. 

The chimeric vector C767 was compared to the C776, C766, the parent C6, 
and the parent C7, each expressing GFP. 

Balb/C mice (25 per group) were immunized intramuscularly with either 
10 Pan 6 or Pan 7 (1 0 !0 particles). Re-administration (10 1 1 particles i.v., by tail vein 
injection) was attempted 3 weeks later using each of the five GFP expressing vectors 
(C6-GFP, C7-GFP, and the three chimeric vectors). Three days later the level of liver 
transduction was estimated qualitatively by examining liver sections for the presence 
of GFP expression and quantitatively by estimating copies of GFP DNA by Taqman 
1 5 analysis. Administration of either one of the two chimpanzee adenovirus vectors does 
not affect the transduction efficiency of the other vector, while re-administration of 
the same vector is severly compromised. The data showed that antibodies to both 
hexon and fiber are important in preventing re-administration of adenoviral vectors. 

20 

All publications cited in this specification are incorporated herein by 
reference, as are the priority documents, US Patent Application 60/575,429, filed 
March 28, 2004; US Patent Application No. 60/566,212, filed April 28, 2004, and US 
Patent Application No. 10/465,302, filed June 20, 2003. While the invention has been 
25 described with reference to a particularly preferred embodiment, it will be appreciated 
that modifications can be made without departing from the spirit of the invention. 
Such modifications are intended to fall within the scope of the appended claims. 
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What is claimed is: 

1. A method of efficiently culturing a chimeric adenovirus in a selected host 
cell, said chimeric adenovirus being from a parental adenovirus strain incapable of 
efficient growth in said host cell, said method comprising the steps of: 
(a) generating a chimeric adenovirus comprising: 

(i) adenovirus sequences of the left terminal end and 
right terminal end of a first adenovirus which grows in a selected host cell type, said 
left end region comprising the 5 5 inverted terminal repeat (ITRs), and said right end 
region comprising the 3' inverted terminal repeat (ITRs); and 

(ii) the internal regions from a parental adenovirus 
which lacks its native 5' and 3' terminal regions, said internal regions comprising the 
late genes encoding the penton, hexon, and fiber; 

wherein the resulting chimeric adenovirus comprises, from 5 5 
to 3\ a left terminal region of the first adenovirus, the internal region of the parental 
adenovirus, and the right terminal region of the first adenovirus; and 

b) culturing said chimeric adenovirus in the presence of functional 
adenovirus El a, Elb, and E4 ORF6 genes from the first adenovirus or from an 
adenovirus serotype which transcomplements the first adenovirus, and further in the 
presence of necessary adenoviral structural genes from the left end of the adenovirus. 

2. The method according to claim 1, wherein the internal region of the 
parental adenovirus further comprises one or more functional adenovirus genes 
selected from the group consisting of Endoprotease open reading frame, DNA 
binding protein, 100 kDa scaffolding protein, 33 kDa protein, protein Vm, pTP, 
52/55 kDa protein, protein VII, Mu and protein VI. 

3. The method according to claim 1, wherein the polymerase, terminal 
protein and 52/55 kDa protein functions are provided in trans. 
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4. The method according to claim 1, wherein the first adenovirus further 
comprises the polymerase, terminal protein and 52/55 kDa protein functions. 

5. The method according to claim 1, wherein the chimeric adenovirus 
comprises the adenoviral late genes 1, 2, 3, 4, and 5 of the parental adenovirus. 

6. The method according to claim 1, wherein the selected host cell stably 
contains one or more of the adenovirus El a, Elb or E4 ORF6 functions. 

7. The method according to claim 1, wherein the chimeric adenovirus 
comprises one or more of the adenovirus El a, Elb or E4 ORP6 of the first 
adenovirus. 

8. The method according to claim 1, wherein the first adenovirus is of 
human origin. 

9. The method according to claim 1, wherein the first adenovirus is of 
simian origin. 

10. The method according to claim 1, further comprising the step of isolating 
the chimeric adenovirus. 

1 1 . A method for generating a chimeric adenovirus for growth in a selected 
host cell, said chimeric adenovirus being derived from a parental adenovirus strain 
incapable of efficient growth in said host cell, said method comprising the step of 
generating a chimeric adenovirus comprising: 

5' and 3 5 terminal regions of a first adenovirus which grows in a 
selected host cell type, said 5' terminal regions comprising the 5 5 inverted terminal 
repeat (ITRs) and necessary El gene functions, and said 3 5 terminal regions 
comprising inverted terminal repeat (ITRs) and necessary E4 gene functions; and 

internal regions from a parental adenovirus which lacks its native 5' 
and 3' terminal regions, said internal regions comprising the hexon, penton base and 
fiber; 
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wherein the resulting chimeric adenovirus comprises, from 5' to 3\ the 5' 
terminal region of the first adenovirus, the internal region of the parental adenovirus, 
and the 3' terminal regions of the first adenovirus. 

12. A chimeric adenovirus produced according to the method of claim 1 . 

13. A chimeric adenovirus comprising a hexon protein of a selected 
adenovirus serotype which is incapable of efficient growth in a selected host cell, 
said modified adenovirus comprising: 

(a) adenovirus sequences of the left terminal end of a first 
adenovirus which grows in a selected host cell type, said left end region comprising 
the El a, El b and 5' inverted terminal repeat (ITRs); 

(b) adenovirus sequences of the internal region of the selected 
adenovirus serotype which is incapable of efficient growth in the selected host cell, 
said internal region comprising the genes encoding the penton, hexon and fiber of the 
selected adenovirus; 

(c) adenovirus sequences of the right terminal end of the first 
adenovirus, said right end region comprising the necessary E4 gene functions and the 
3 J inverted terminal repeat (ITRs), 

wherein the resulting chimeric adenovirus comprises 
adenoviral structural and regulatory proteins necessary for infection and replication. 

14. The chimeric adenovirus according to claim 13, wherein the chimeric 
adenovirus further comprises the Ilia, 52/55kDa and terminal protein (pTP) of the 
selected adenovirus serotype. 

15. The chimeric adenovirus according to claim 13, wherein chimeric 
adenovirus comprises the polymerase of the first adenovirus. 

16. The chimeric adenovirus according to claim 13, wherein the chimeric 
adenovirus expresses a functional chimeric protein formed from the first adenovirus 
and the selected adenovirus, said chimeric protein is selected from the group 
consisting of polymerase, terminal protein, 52/55 kDa protein, and Ilia. 
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17. The chimeric adenovirus according to claim 13, wherein the chimeric 
adenovirus comprises the terminal protein, 52/55 kDa, and/or Ilia of the selected 
adenovirus. 

1 8. A host cell comprising a chimeric adenovirus according to claim 12. 

19. The host cell according to claim 18, wherein said host cell is a human 

cell. 

20. An isolated simian adenovirus nucleic acid sequence selected from the 
group consisting of: 

(a) SA18 having the sequence of nucleic acids 1 to 31967 of SEQ ID 

NO: 12 and 

(b) a nucleic acid sequence complementary to the sequence of any of 

(a) to (f). 

21. An isolated simian adenovirus serotype nucleic acid sequence selected 
from one or more of the group consisting of: 

(a) 5' inverted terminal repeat (ITR) sequences; 

(b) the adenovirus El a region, or a fragment thereof selected 
from among the 13S, 12S and 9S regions; 

(c) the adenovirus Elb region, or a fragment thereof selected 
from among the group consisting of the small T, large T, DC, and IVa2 regions; 

(d) the E2b region; 

(e) the LI region, or a fragment thereof selected from among 
the group consisting of the 28.1 kD protein, polymerase, agnoprotein, 52/55 kD 
protein, and ma protein; 

(f) the L2 region, or a fragment thereof selected from the 
group consisting of the penton, VII, VI, and Mu proteins; 

(g) the L3 region, or a fragment thereof selected from the 
group consisting of the VI, hexon, or endoprotease; 

(h) the 2a protein; 
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(i) the L4 region, or a fragment thereof selected from the 
group consisting of the 100 kD protein, the 33 kD homolog, and VIII; 

(j) the E3 region, or a fragment thereof selected from the 
group consisting of E3 ORF1, E3 ORF2, E3 ORF3, E3 ORF4, E3 ORf 5, E3 ORF6, 
E3 ORF7, E3 ORF8, and E3 ORF9; 

(k) the L5 region, or a fragment thereof selected from a fiber 

protein; 

(1) the E4 region, or a fragment thereof selected from the 
group consisting of E4 ORF7, E4 ORF6, E4 ORF4, E4 ORF3, E4 ORF2, and E4 
ORFl;and 

(m) the 3' ITR, of any of SA18 SEQ ID NO:12, or a sequence 
complementary to any of (a) to (m). 

22. A simian adenovirus protein encoded by the nucleic acid sequence 
according to claim 21 . 

23. A composition comprising a simian adenovirus capsid protein according 
to claim 22 linked to a heterologous molecule for delivery to a selected host cell. 

24. A method for targeting a cell having an adenoviral receptor comprising 
delivering to a subject a composition according to claim 23. 

25. A nucleic acid molecule comprising a heterologous simian adenoviral 
sequence according to claim 21. 

26. The nucleic acid molecule according to claim 25, wherein said simian 
adenoviral sequence encodes an adenoviral gene product and is operatively linked to 
regulatory control sequences which direct expression of the adenoviral gene product 
in a host cells. 

27. The nucleic acid molecule according to claim 25, wherein said simian 
adenoviral sequence comprises the El a region of SA18 SEQ ID NO: 12. 
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28. A pharmaceutical composition comprising the nucleic acid molecule 
according to claim 27 and a physiologically compatible carrier. 

29. A recombinant adenovirus having a capsid comprising a protein selected 
from the group consisting of: 

(a) a hexon protein of SA18, SEQ ED NO 13, or a unique fragment 

thereof; 

(b) a penton protein of SA1 8, SEQ ID NO: 14, or a unique fragment 

thereof; 

(c) a fiber protein of SA1 8, SEQ ID NO: 1 5, or a unique fragment 

thereof. 

30. The recombinant adenovirus according to claim 29, wherein the capsid is 
of an artificial serotype. 

3 1 . The recombinant adenovirus according to claim 29, wherein said virus 
further comprises a heterologous gene operatively linked to sequences which direct 
expression of said gene in a host cell. 

32. The recombinant adenovirus according to claim 29, further comprising 
5' and 3' adenovirus cis-elements necessary for replication and encapsidation. 

33. The recombinant adenovirus according to claim 29, wherein said vector 
lacks all or a part of the El gene. 

34. A host cell comprising a heterologous nucleic acid molecule comprising 
the nucleic acid sequence according to claim 21. 

35. The host cell according to claim 34, wherein said host cell is stably 
transformed with the nucleic acid molecule. 
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36. The host cell according to claim 34, wherein said host cell expresses one 
or more adenoviral gene products from said nucleic acid molecule, said adenoviral 
gene products selected from the group consisting of Ela, Elb, E2a, and E4 ORF6. 

37. The host cell according to claim 34, wherein said host cell is stably 
transformed with a nucleic acid molecule comprising the simian adenovirus inverted 
terminal repeats. 

38. A composition comprising a recombinant virus according to claim 29 in 
a pharmaceutical^ acceptable carrier. 

39. A method for delivering a heterologous gene to a mammalian cell 
comprising introducing into said cell an effective amount of the recombinant virus 
according to claim 29. 

40. A method for repeat administration of a heterologous gene to a mammal 
comprising the steps of: 

(a) introducing into said mammal a first vector which 
comprises the heterologous gene and 

(b) introducing into said mammal a second vector which 
comprises the heterologous gene; 

wherein at least the first virus or the second vector is a 
virus according to claim 29 and wherein the first and 
second recombinant vector are different. 

41 . A method for producing a selected gene product comprising infecting a 
mammalian cell with the recombinant virus according to claim 29, culturing said cell 
under suitable conditions and recovering from said cell culture the expressed gene 
product. 

42. A method for eliciting an immune response in a mammalian host against 
an infective agent comprising administering to said host an effective amount of the 
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recombinant adenovirus of claim 29, wherein said heterologous gene encodes an 
antigen of the infective agent. 

43. The method according to claim 42, comprising the step of priming the 
host with a DNA vaccine comprising the heterologous gene prior to administering 
the recombinant adenovirus. 
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gcgacgacgc 


gacggttgac 


gtcctggatc 


tgacgcctct 


gggtgaaggc 


cacgggaccc 


8640 


gtgagtttga 


acctgaaaga 


gagttcgaca 


gaatcaatct 


cggtatcgtt 


gacggcggcc 


8700 


tgccgcagga 


tctcttgcac 


gtcgcccgag 


ttgtcctggt 


aggcgatctc 


ggtcatgaac 


8760 


tgctcgatct 


cctcctcctg 


aaggtctccg 


cgaccggcgc 


gctcgacggt 


ggccgcgagg 


8820 


tcgttggaga 


tgcggcccat 


gagctgcgag 


aaggcgttca 


tgccggcctc 


gttccagacg 


8880 


cggctgtaga 


ccacggctcc 


gtcggggtcg 


cgcgcgcgca 


tgaccacctg 


ggcgaggttg 


8940 


agctcgacgt 


ggcgcgtgaa 


gaccgcgtag 


ttgcagaggc 


gctggtagag 


gtagttgagc 


9000 


gtggtggcga 


tgtgctcggt 


gacgaagaag 


tacatgatcc 


agcggcggag 


cggcatctcg 


9060 


ctgacgtcgc 


ccagggcttc 


caagcgctcc 


atggcctcgt 


agaagtccac 


ggcgaagttg 


9120 


aaaaactggg 


agttgcgcgc 

3 3 3 3 


cgagacggtc 


aactcctcct 


ccagaagacg 


gatgagctcg 


9180 


gcgatggtgg 


cgcgcacctc 


gcgctcgaag 


gccccggggg 


gctcctcttc 


ttccatctcc 


9240 


tcctcctctt 


ccatctcctc 


cactaacatc 


tcttctactt 


cctcctcagg 


aggcggcggc 


9300 


gggggagggg 


ccctgcgtcg 


ccggcggcgc 


acgggcagac 


ggtcgatgaa 


gcgctcgatg 


9360 


gtctccccgc 


gccggcgacg 


catggtctcg 


gtgacggcgc 


gcccgtcctc 


gcggggccgc 


9420 


agcgtgaaga 


cgccgccgcg 


catctccagg 


tggccgccgg 
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9480 
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gagagggcgc 


tgacgatgca 


tcttatcaat 


tggcccgtag 


ggactccgcg 


caaggacctg 


9540 


agcgtctcga 


gatccacggg 


atccgaaaac 


cgctgaacga 


aggcttcgag 


ccagtcgcag 


9600 


tcgcaaggta 


ggctgagccc 


ggtttcttgt 


tcttcgggta 


tttggtcggg 


aggcgggcgg 


9660 


gcgatgctgc 


tggtgatgaa 


gttgaagtag 


gcggtcctga 


gacggcggat 


ggtggcgagg 


9720 


agcaccaggt 


ccttgggccc 


ggcttgctgg 


atgcgcagac 


ggtcggccat 


gccccaggcg 


9780 


tggtcctgac 


acctggcgag 


gtccttgtag 


tagtcctgca 


tgagccgctc 


cacgggcacc 


9840 


tcctcctcgc 


ccgcgcggcc 


gtgcatgcgc 


gtgagcccga 


acccgcgctg 


cggctggacg 


9900 


agcgccaggt 


cggcgacgac 


gcgctcggcg 


aggatggcct 


gctggatctg 


ggtgagggtg 


9960 


gtctggaagt 


cgtcgaagtc 


gacgaagcgg 


tggtaggctc 


cggtgttgat 


ggtgtaggag 


10020 


cagttggcca 


tgacggacca 


gttgacggtc 


tggtggccgg 


ggcgcacgag 


ctcgtggtac 


10080 


ttgaggcgcg 


agtaggcgcg 


cgtgtcgaag 


atgtagtcgt 


tgcaggtgcg 


cacgaggtac 


10140 


tggtatccga 


cgaggaagtg 


cggcggcggc 


tggcggtaga 


gcggccatcg 


ctcggtggcg 


10200 


ggggcgccgg 


gcgcgaggtc 


ctcgagcatg 


aggcggtggt 


agccgtagat 


gtacctggac 


10260 


atccaggtga 


tgccggcggc 


ggtggtggag 


gcgcgcggga 


actcgcggac 


gcggttccag 


10320 


atgttgcgca 


gcggcaggaa 


gtagttcatg 


gtggccgcgg 


tctggcccgt 


gaggcgcgcg 


10380 


cagtcgtgga 


tgctctagac 


atacgggcaa 


aaacgaaagc 


ggtcagcggc 


tcgactccgt 


10440 


ggcctggagg 


ctaagcgaac 


gggttgggct 


gcgcgtgtac 


cccggttcga 


gtccctgctc 


10500 


gaatcaggct 


ggagccgcag 


ctaacgtggt 


actggcactc 


ccgtctcgac 


ccaagcctgc 


10560 


taacgaaacc 


tccaggatac 


ggaggcgggt 


cgttttggcc 


attttcgtca 


ggccggaaat 


10620 


gaaactagta 


agcgcggaaa 


gcggccgtcc 


gcgatggctc 


gctgccgtag 


tctggagaaa 


10680 


gaatcgccag 


ggttgcgttg 


cggtgtgccc 


cggttcgagc 


ctcagcgctc 


ggcgccggcc 


10740 


ggattccgcg 


gctaacgtgg 


gcgtggctgc 


cccgtcgttt 


ccaagacccc 


ttagccagcc 


10800 


gacttctcca 


gttacggagc 


gagcccctct 


ttttcttgtg 


tttttgccag 


atgcatcccg 


10860 


tactgcggca 


gatgcgcccc 


caccctccac 


cacaaccgcc 


cctaccgcag 


cagcagcaac 


10920 


agccggcgct 


tctgcccccg 


ccccagcagc 


agcagccagc 


cactaccgcg 


gcggccgccg 


10980 


tgagcggagc 


cggcgttcag 


tatgacctgg 


ccttggaaga 


gggcgagggg 


ctggcgcggc 


11040 


tgggggcgtc 


gtcgccggag 


cggcacccgc 


gcgtgcagat 


gaaaagggac 


gctcgcgagg 


11100 


cctacgtgcc 


caagcagaac 


ctgttcagag 


acaggagcgg 


cgaggagccc 


gaggagatgc 


11160 


gcgcctcccg 


cttccacgcg 


gggcgggagc 


tgcggcgcgg 


cctggaccga 


aagcgggtgc 


11220 


tgagggacga 


ggatttcgag 


gcggacgagc 


tgacggggat 


cagccccgcg 


cgcgcgcacg 


11280 


tggccgcggc 


caacctggtc 


acggcgtacg 


agcagaccgt 


gaaggaggag 


agcaacttcc 


11340 


aaaaatcctt 


caacaaccac 


gtgcgcacgc 


tgatcgcgcg 


cgaggaggtg 


accctgggcc 


11400 


tgatgcacct 


gtgggacctg 


ctggaggcca 


tcgtgcagaa 


ccccacgagc 


aagccgctga 


11460 


cggcgcagct 


gtttctggtg 


gtgcagcaca 


gtcgggacaa 
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cgagacgttc 


agggaggcgc 
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tactaaatat 


caccgagccc 


qaqqqccqct 

y^yyy^^y* 


ggctcctgga 


cctggtgaac 


attctgcaga 


11580 


acatcQtaat 


qcaqqaqcqc 


qqqctqccqc 

333 w *-3 ^" 3 ^" 


tgtccgagaa 


gctggcggcc 


atcaacttct 


11640 


caatactaaa 


cctqqqcaaq 


tactacgcta 


ggaagatcta 


caagaccccg 


tacgtgccca 


11700 


taaacaaaqa 


ggtgaagatc 


qacqqqtttt 

3***-333 *- *• *> 


acatgcgcat 


gaccctgaaa 


gtgctgaccc 


11760 


taaacaacaa 


tctqqqqqtq 


taccgcaacg 


acaggatgca 


ccgcgcggtg 

^^"3 3 33 3 


agcgccagcc 


11820 


accaacacaa 


qctqaqcqac 

3 3 3 3 


caqqaqctqa 


tgcacagcct 


gcagcgggcc 


ctgaccgggg 


11880 


ccaaaaccaa 

V* V* %J *7 *A ^» *A 


qqqqqaqaqc 


tactttgaca 


tgggcgcgga 

**333 3 33 


cctgcgctgg 

3 3 33 


cagcctagcc 


11940 


accaaacctt 


qqaaqctqcc 

33 3 *- 3 


qqcqqttccc 

33^-33 w w*-w^» 


cctacgtgga 


ggaggtggac 

33 33 33 


gatgaggagg 


12000 


aaaaaaacaa 

w yy u yyy w y u 


gtacctggaa 


qactqatqqc 


gcgaccgtat 


ttttgctaga 


tgcagcaaca 


12060 


accaccQcca 


cctcctgatc 


ccqcqatqcq 


ggcggcgctg 

33^"33*"3*" *- 3 


cagagccagc 


cgtccggcat 


12120 


taactcctcg 


qacqattqqa 

3 ^* 3^^ 3 3^^ 


cccaggccat 


gcaacgcatc 


atggcgctga 


cgacccgcaa 


12180 


tcccgaagcc 


tttagacagc 


aqcctcaqqc 

27 23 27 


caaccgactc 


tcggccatcc 


tggaggccgt 

33 3 


12240 


qqtqccctca 

3 3 3 3 


cgctcgaacc 


ccacgcacga 


gaaggtgctg 

3 33 3 3 


gccatcgtga 


acgcgctggt 


12300 


ggagaacaag 


qccatccqcq 


qcgacgaggc 


cgggctggtg 

333 33 ^3 


tacaacgcgc 


tgctggagcg 

3 33 3 3 


12360 


cqtqqcccqc 

J k -33 v * v "^"3 


tacaacagca 


ccaacgtgca 


gacgaacctg 


gaccgcatgg 

3 3 *33 


tgaccgacgt 


12420 


qcqcqaqqcq 


qtqtcqcagc 


gcgagcggtt 

3^"3 3^""33 *■ *■ 


ccaccgcgag 


tcgaacctgg 


gctccatggt 


12480 


qqcqctqaac 


gccttcctga 


qcacqcaqcc 

27 ^* 27 23 


cgccaacgtg 


ccccgqggcc 


aggaggacta 


12540 


caccaacttc 


atcagcgcgc 


tqcqqctqat 

»3 3 3 3 


qqtqqccqaq 

3 3 w 33 % ~*"3**3 


gtgccccaga 


gcgaggtgta 

3 V 3**33 3 


12600 


ccaqtcqqqq 

3 uw yyzjy 


ccggactact 


tcttccagac 


cagtcgccag 


ggcttgcaga 


ccgtgaacct 


12660 


gagccaggct 


ttcaagaact 


tqcaqqqact 


gtggggcgtg 

3 3 **3 


caggccccgg 


tcggggaccg 

3333 3 


12720 


cqcqacqqtq 


tcqaqcctqc 


tgacgccgaa 


ctcgcgcctg 


ctgctgctgc 


tggtggcgcc 

33 33 3 


12780 


cttcacggac 


agcggcagcg 

3 33 3 3 


tgagccgcga 

**3*^3*"^*3^"3*** 


ctcgtacctg 


ggctacctgc 


ttaacctgta 


12840 


ccqcqaqqcc 


atcqqqcaqq 

w 333 *"*33 


cgcacgtgga 

3 3 fc 33 w 


cgagcagacc 


taccaggaga 


tcacccacgt 


12900 


qaqccqcqcq 

3 3 ^ *-3 *-3 *-3 


ctqqgccagg 


aggacccggg 

33**^"^*^"333 


caacctggag 


gccaccctga 


acttcctgct 


12960 


gaccaaccgg 


tcqcagaaqa 


tcccgcccca 


gtacgcgctg 


agcaccgagg 


aggagcgcat 


13020 


cctgcgctac 


qtqcaqcaqa 

23 w 27 2? 2? 


gcqtggggct 

3*"3 w 3333 vw 


gttcctgatg 


caggaggggg 

^**3 3»*33333 


ccacgcccag 


13080 


cgccgcgctc 


gacatgaccg 


cgcgcaacat 


ggagcccagc 


atgtacgccc 


gcaaccgccc 


13140 


gttcatcaat 


aagctgatgg 


actacttgca 


tcqqqcqqcc 

333 33 


gccatgaact 


cggactactt 


13200 


taccaacgcc 


atcttgaacc 


cgcactggct 


cccgccgccc 


gggttctaca 


cgggcgagta 

333 3 3 


13260 


cgacatgccc 


gaccccaacg 


acgggttcct 


gtgggacgac 


gtggacagca 


gcgtgttctc 


13320 


gccqcqcccc 


accaccacca 


ccgtgtggaa 


qaaaqagggc 

3** v ***3**333* - 


ggqgaccqgc 

332»3 , * v,v, 33*' 


qqccqtcctc 


13380 


ggcgctgtcc 


ggtcgcgcgg 

33 3^*3 33 


gtgctgccgc 

3 3 ^ *"3 3 


ggcggtgccc 

33*-33 w 3 ^* w ^ 


gaggccqcca 


gccccttccc 


13440 


gagcctgccc 


ttttcgctga 


acagcgtgcg 


cagcagcgag 


ctgggtcggc 


tgacgcggcc 


13500 


gcgcctgctg 


ggcgaggagg 


agtacctgaa 


cgactccttg 


cttcggcccg 


agcgcgagaa 


13560 


gaacttcccc 


aataacggga 


tagagagcct 


ggtggacaag 
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atgagccgct 


ggaagacgta 


13620 
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cgcgcacgag 


cacagggacg 


agccccgagc 


tagcagcagc 


accggcgcca 


cccgtagacg 


13680 


ccagcggcac 


gacaggcagc 


ggggtctggt 


gtgggacgat 


gaggattccg 


ccgacgacag 


13740 


cagcgtgttg 


gacttgggtg 


ggagtggtgg 


tggtaacccg 


ttcgctcacc 


tgcgcccccg 


13800 


tatcgggcgc 


ctgatgtaag 


aatctgaaaa 


aataaaagac 


ggtactcacc 


aaggccatgg 


13860 


cgaccagcgt 


gcgttcttct 


ctgttgtttg 


tagtagtatg 


atgaggcgcg 


tgtacccgga 


13920 


gggtcctcct 


ccctcgtacg 


agagcgtgat 


gcagcaggcg 


gtggcggcgg 


cgatgcagcc 


13980 


cccgctggag 


gcgccttacg 


tgcccccgcg 


gtacctggcg 


cctacggagg 


ggcggaacag 


14040 


cattcgttac 


tcggagctgg 


cacccttgta 


cgataccacc 


cggttgtacc 


tggtggacaa 


14100 


caagtcggcg 


gacatcgcct 


cgctgaacta 


ccagaacgac 


cacagcaact 


tcctgaccac 


14160 


cgtggtgcag 


aacaacgatt 


tcacccccac 


ggaggccagc 


acccagacca 


tcaactttga 


14220 


cgagcgctcg 


cggtggggcg 


gccagctgaa 


aaccatcatg 


cacaccaaca 


tgcccaacgt 


14280 


gaacgagttc 


atgtacagca 


acaagttcaa 


ggcgcgggtg 


atggtctcgc 


gcaagacccc 


14340 


caacggggtc 


acagtaacag 


atggtagtca 


ggacgagctg 


acctacgagt 


gggtggagtt 


14400 


tgagctgccc 


gagggcaact 


tctcggtgac 


catgaccatc 


gatctgatga 


acaacgccat 


14460 


catcgacaac 


tacttggcgg 


tggggcggca 


gaacggggtg 


ctggagagcg 


acatcggcgt 


14520 


gaagttcgac 


acgcgcaact 


tccggctggg 


ctgggacccc 


gtgaccgagc 


tggtgatgcc 


14580 


gggcgtgtac 


accaacgagg 


ccttccaccc 


cgacatcgtc 


ctgctgcccg 


gctgcggcgt 


14640 


ggacttcacc 


gagagccgcc 


tcagcaacct 


gctgggcatc 


cgcaagcggc 


agcccttcca 


14700 


ggagggcttc 


cagatcctgt 


acgaggacct 


ggaggggggc 


aacatccccg 


cgctgctgga 


14760 


cgtggacgcc 


tacgagaaaa 


gcaaggagga 


tagcgccgcc 


gcggcgaccg 


cagccgtggc 


14820 


caccgcctct 


accgaggtgc 


ggggcgataa 


ttttgctagc 


gccgcgacac 


tggcagcggc 


14880 


cgaggcggct 


gaaaccgaaa 


gtaagatagt 


gatccagccg 


gtggagaagg 


acagcaagga 


14940 


gaggagctac 


aacgtgctcg 


cggacaagaa 


aaacaccgcc 


taccgcagct 


ggtacctggc 


15000 


ctacaactac 


ggcgaccccg 


agaagggcgt 


gcgctcctgg 


acgctgctca 


ccacctcgga 


15060 


cgtcacctgc 


ggcgtggagc 


aagtctactg 


gtcgctgccc 


gacatgatgc 


aagacccggt 


15120 


caccttccgc 


tccacgcgtc 


aagttagcaa 


ctacccggtg 


gtgggcgccg 


agctcctgcc 


15180 


cgtctactcc 


aagagcttct 


tcaacgagca 


ggccgtctac 


tcgcagcagc 


tgcgcgcctt 


15240 


cacctcgctc 


acgcacgtct 


tcaaccgctt 


ccccgagaac 


cagatcctcg 


ttcgcccgcc 


15300 


cgcgcccacc 


attaccaccg 


tcagtgaaaa 


cgttcctgct 


ctcacagatc 


acgggaccct 


15360 


gccgctgcgc 


agcagtatcc 


ggggagtcca 


gcgcgtgacc 


gtcactgacg 


ccagacgccg 


15420 


cacctgcccc 


tacgtctaca 


aggccctggg 


cgtagtcgcg 


ccgcgcgtcc 


tctcgagccg 


15480 


caccttctaa 


aaaatgtcca 


ttctcatctc 


gcccagtaat 


aacaccggtt 


ggggcctgcg 


15540 


cgcgcccagc 


aagatgtacg 


gaggcgctcg 


ccaacgctcc 


acgcaacacc 


ccgtgcgcgt 


15600 


gcgcgggcac 


ttccgcgctc 


cctggggcgc 


cctcaagggc 
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cgcgtgcgct 
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cgtcgacgac 


gtgatcgacc 


aggtggtggc 


cgacgcgcgc 


aactacacgc 


ccgccgccgc 


15720 


gcccgtctcc 


accgtggacg 


ccgtcatcga 


cagcgtggtg 


gccgacgcgc 


gccggtacgc 


15780 


ccgcgccaag 


agccggcggc 


ggcgcatcgc 


ccggcggcac 


cggagcaccc 


ccgccatgcg 


15840 


cgcggcgcga 


gccttgctgc 


gcagggccag 


gcgcacggga 


cgcagggcca 


tgctcagggc 


15900 


ggccagacgc 

ZJ ZJ 9 *J 


gcggcctccg 


gcagcagcag 


cgccggcagg 


acccgcagac 


gcgcggccac 


15960 


ggcggcggcg 

33 ZJ ZJ S ZJ 9 


gcggccatcg 


ccagcatgtc 


ccgcccgcgg 


cgcggcaacg 


tgtactgggt 


16020 


gcgcgacgcc 


gccaccggtg 


tgcgcgtgcc 


cgtgcgcacc 


cgcccccctc 


gcacttgaag 


16080 


atgctgactt 

ZJ 9 


cgcgatgttg 


atgtgtccca 


gcggcgagga 


ggatgtccaa 


gcgcaaattc 


16140 


aaggaagaga 


tgctccaggt 


catcgcgcct 


gagatctacg 


gcccggcggc 


ggtgaaggag 


16200 


gaaagaaagc 


cccgcaaact 


gaagcgggtc 


aaaaaggaca 


aaaaggagga 


ggaagatgtg 


16260 


gacggactgg 

3 ZJ ZJ ZJ ZJ 


tggagtttgt 


gcgcgagttc 


gccccccggc 


ggcgcgtgca 


gtggcgcggg 


16320 


cggaaagtga 


aaccggtgct 


gcgacccggc 


accacggtgg 


tcttcacgcc 


cggcgagcgt 


16380 


tccggctccg 


cctccaagcg 


ctcctacgac 


gaggtgtacg 


gggacgagga 


catcctcgag 


16440 


caggcggccg 


aacgtctggg 


cgagtttgct 


tacggcaagc 


gcagccgccc 


cgcgcccttg 


16500 


aaagaggagg 


cggtgtccat 


cccgctggac 


cacggcaacc 


ccacgccgag 


cctgaagccg 


16560 


gtgaccctgc 


agcaggtgct 


gcctggtgcg 


gcgccgcgcc 


ggggcttcaa 


gcgcgagggc 


16620 


ggcgaggatc 


tgtacccgac 


catgcagctg 


atggtgccca 


agcgccagaa 


gctggaggac 


16680 


gtgctggagc 


acatgaaggt 


ggaccccgag 


gtgcagcccg 


aggtcaaggt 


gcggcccatc 


16740 


aagcaggtgg 


ccccgggcct 


QQQcgtgcag 


accgtggaca 


tcaagatccc 


cacggagccc 


16800 


atggaaacgc 


agaccgagcc 


cgtgaagccc 


agcaccagca 


ccatggaggt 


gcagacggat 


16860 


ccctggatgc 


cggcaccggc 


ttccaccacc 


cgccgaagac 


gcaagtacgg 


cgcggccagc 


16920 


ctgctgatgc 


ccaactacgc 


gctgcatcct 


tccatcatcc 


ccacgccggg 


ctaccgcggc 


16980 


acgcgcttct 


accgcggcta 


caccagcagc 


cgccgccgca 


agaccaccac 


ccgccgccgc 


17040 


cgtcgtcgca 


cccgccgcag 


cagcaccgcg 


acttccgccg 


ccgccctggt 


gcggagagtg 


17100 


taccgcagcg 


ggcgcgagcc 


tctgaccctg 


ccgcgcgcgc 


gctaccaccc 


gagcatcgcc 


17160 


atttaactac 


cgcctcctac 


ttgcagatat 


ggccctcaca 


tgccgcctcc 


gcgtccccat 


17220 


tacgggctac 


cgaggaagaa 


agccgcgccg 


tagaaggctg 


acggggaacg 


ggctgcgtcg 


17280 


ccatcaccac 


cggcggcggc 


gcgccatcag 


caagcggttg 


gggggaggct 


tcctgcccgc 


17340 


gctgatgccc 


atcatcgccg 


cggcgatcgg 


ggcgatcccc 


ggcatagctt 


ccgtggcggt 


17400 


gcaggcctct 


cagcgccact 


gagacacagc 


ttggaaaatt 


tgtaataaaa 


aatggactga 


17460 


cgctcctggt 


cctgtgatgt 


gtgtttttag 


atggaagaca 


tcaatttttc 


gtccctggca 


17520 


ccgcgacacg 


gcacgcggcc 


gtttatgggc 


acctggagcg 


acatcggcaa 


cagccaactg 


17580 


aacgggggcg 


ccttcaattg 


gagcagtctc 


tggagcgggc 


ttaagaattt 


cgggtccacg 


17640 


ctcaaaacct 


atggcaacaa 


ggcgtggaac 


agcagcacag 


ggcaggcgct 


gagggaaaag 


17700 


ctgaaagagc 


agaacttcca 


gcagaaggtg 


gtcgatggcc 
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tggcctcggg 


catcaacggg 


17760 
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gtggtggacc 


tggccaacca 


ggccgtgcag 


aaacagatca 


acagccgcct 


ggacgcggtc 


17820 


ccgcccgcgg 


ggtccgtgga 


gatgccccag 


gtggaggagg 


agctgcctcc 


cctggacaag 


17880 


cgcggcgaca 


agcgaccgcg 


tcccgacgcg 


gaggagacgc 


tgctgacgca 


cacggacgag 


17940 


ccgcccccgt 


acgaggaggc 


ggtgaaactg 


ggtctgccca 


ccacgcggcc 


cgtggcgcct 


18000 


ctggccaccg 


gggtgctgaa 


acccagcagc 


agcagcagcc 


agcccgcgac 


cctggacttg 


18060 


cctccgcctg 


cttcccgccc 


ctccacagtg 


gctaagcccc 


tgccgccggt 


ggccgtcgcg 


18120 


tcgcgcgccc 


cccgaggccg 


cccccaggcg 


aactggcaga 


gcactctgaa 


cagcatcgtg 


18180 


ggtctgggag 


tgcagagtgt 


gaagcgccgc 


cgctgctatt 


aaaagacact 


gtagcgctta 


18240 


acttgcttgt 


ctgtgtgtat 


atgtatgtcc 


gccgaccaga 


aggaggagga 


agaggcgcgt 


18300 


cgccgagttg 


caagatggcc 


accccatcga 


tgctgcccca 


gtgggcgtac 


atgcacatcg 


18360 


ccggacagga 


cgcttcggag 


tacctgagtc 


cgggtctggt 


gcagttcgcc 


cgcgccacag 


18420 


acacctactt 


cagtctgggg 


aacaagttta 


ggaaccccac 


ggtggcgccc 


acgcacgatg 


18480 


tgaccaccga 


ccgcagccag 


cggctgacgc 


tgcgcttcgt 


gcccgtggac 


cgcgaggaca 


18540 


acacctactc 


gtacaaagtg 


cgctacacgc 


tggccgtggg 


cgacaaccgc 


gtgctggaca 


18600 


tggccagcac 


ctactttgac 


atccgcggcg 


tgctggatcg 


gggccctagc 


ttcaaaccct 


18660 


actccggcac 


cgcttacaac 


agcctggctc 


ccaagggagc 


gcccaacact 


tgccagtgga 


18720 


catataaagc 


tgatggtgat 


actggtacag 


aaaaaaccta 


tacatatgga 


aatgcgcctg 


18780 


tgcaaggcat 


tagtattaca 


aaagatggta 


ttcaacttgg 


aactgacact 


gatgatcagc 


18840 


ccatttatgc 


agataaaact 


tatcaaccag 


agcctcaagt 


gggtgatgct 


gaatggcatg 


18900 


acatcactgg 


tactgatgaa 


aaatatggag 


gcagagctct 


caagcctgac 


accaaaatga 


18960 


agccctgcta 


tggttctttt 


gccaagccta 


ccaataaaga 


aggaggtcag 


gcaaatgtga 


19020 


aaaccgaaac 


aggcggtacc 


aaagaatatg 


acattgacat 


ggcattcttc 


gataatcgaa 


19080 


gtgcagctgc 


ggctggcctg 


gccccagaaa 


ttgttttgta 


tactgagaat 


gtggatctgg 


19140 


aaactccaga 


tactcatatt 


gtatacaagg 


cgggcacaga 


tgacagcagc 


tcttctatca 


19200 


atttgggtca 


gcagtccatg 


cccaacagac 


ccaactacat 


tggctttaga 


gacaacttta 


19260 


tcgggctcat 


gtactacaac 


agcactggca 


acatgggcgt 


gctggctggt 


caggcctccc 


19320 


agctgaatgc 


tgtggtggac 


ttgcaggaca 


gaaacactga 


actgtcctac 


cagctcttgc 


19380 


ttgactctct 


gggcgacaga 


accaggtatt 


tcagtatgtg 


gaatcaggcg 


gtggacagct 


19440 


atgaccccga 


tgtgcgcatt 


attgaaaatc 


acggtgtgga 


ggatgaactc 


cctaactatt 


19500 


gcttccccct 


ggatgctgtg 


ggtagaactg 


atacttacca 


gggaattaag 


gccaatggtg 


19560 


ctgatcaaac 


cacctggacc 


aaagatgata 


ctgttaatga 


tgctaatgaa 


ttgggcaagg 


19620 


gcaatccttt 


cgccatggag 


atcaacatcc 


aggccaacct 


gtggcggaac 


ttcctctacg 


19680 


cgaacgtggc 


gctgtacctg 


cccgactcct 


acaagtacac 


gccggccaac 


atcacgctgc 


19740 


cgaccaacac 


caacacctac 


gattacatga 


acggccgcgt 


ggtggcgccc 


tcgctggtgg 


19800 
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acgcctacat 


caacatcggg 


gcgcgctggt 


cgctggaccc 


catggacaac 


gtcaacccct 


19860 


tcaaccacca 


ccgcaacgcg 


ggcctgcgct 


accgctccat 


gctcctgggc 


aacgggcgct 


19920 


acgtgccctt 


ccacatccag 


gtgccccaaa 


agttcttcgc 


catcaagagc 


ctcctgctcc 


19980 


tgcccgggtc 


ctacacctac 


gagtggaact 


tccgcaagga 


cgtcaacatg 


atcctgcaga 


20040 


gctccctcgg 


caacgacctg 


cgcacggacg 


gggcctccat 


cgccttcacc 


agcatcaacc 


20100 


tctacgccac 


cttcttcccc 


atggcgcaca 


acaccgcctc 


cacgctcgag 


gccatgctgc 


20160 


gcaacgacac 


caacgaccag 


tccttcaacg 


actacctctc 


ggcggccaac 


atgctctacc 


20220 


ccatcccggc 


caacgccacc 


aacgtgccca 


tctccatccc 


ctcgcgcaac 


tgggccgcct 


20280 


tccgcggatg 


gtccttcacg 


cgcctcaaga 


cccgcgagac 


gccctcgctc 


ggctccgggt 


20340 


tcgaccccta 


cttcgtctac 


tcgggctcca 


tcccctacct 


cgacggcacc 


ttctacctca 


20400 


accacacctt 


caagaaggtc 


tccatcacct 


tcgactcctc 


cgtcagctgg 


cccggcaacg 


20460 


accgcctcct 


gacgcccaac 


gagttcgaaa 


tcaagcgcac 


cgtcgacgga 


gaggggtaca 


20520 


acgtggccca 


gtgcaacatg 


accaaggact 


ggttcctggt 


ccagatgctg 


gcccactaca 


20580 


acatcggcta 


ccagggcttc 


tacgtgcccg 


agggctacaa 


ggaccgcatg 


tactccttct 


20640 


tccgcaactt 


ccagcccatg 


agccgccagg 


tcgtggacga 


ggtcaactac 


aaggactacc 


20700 


aggccgtcac 


cctggcctac 


cagcacaaca 


actcgggctt 


cgtcggctac 


ctcgcgccca 


20760 


ccatgcgcca 


gggacagccc 


taccccgcca 


actaccccta 


cccgctcatc 


ggcaagagcg 


20820 


ccgtcgccag 


cgtcacccag 


aaaaagttcc 


tctgcgaccg 


ggtcatgtgg 


cgcatcccct 


20880 


tctccagcaa 


cttcatgtcc 


atgggcgcgc 


tcaccgacct 


cggccagaac 


atgctctacg 


20940 


ccaactccgc 


ccacgcgcta 


gacatgaatt 


tcgaagtcga 


ccccatggat 


gagtccaccc 


21000 


ttctctatgt 


tgtcttcgaa 


gtcttcgacg 


tcgtccgagt 


gcaccagccc 


caccgcggcg 


21060 


tcatcgaggc 


cgtctacctg 


cgcacgccct 


tctcggccgg 


caacgccacc 


acctaagccc 


21120 


cgctcttgct 


tcttgcaaga 


tgacggcctg 


tgcgggctcc 


ggcgagcagg 


agctcagggc 


21180 


catcctccgc 


gacctgggct 


gcgggccctg 


cttcctgggc 


accttcgaca 


agcgcttccc 


21240 


gggattcatg 


gccccgcaca 


agctggcctg 


cgccatcgtc 


aacacggccg 


gccgcgagac 


21300 


cgggggcgag 


cactggctgg 


ccttcgcctg 


gaacccgcgc 


tcccacacct 


gctacctctt 


21360 


cgaccccttc 


gggttctcgg 


acgagcgcct 


caagcagatc 


taccagttcg 


agtacgaggg 


21420 


cctgctgcgc 


cgcagcgccc 


tggccaccga 


ggaccgctgc 


gtcaccctgg 


aaaagtccac 


21480 


ccagaccgtg 


cagggtccgc 


gctcggccgc 


ctgcgggctc 


ttctgctgca 


tgttcctgca 


21540 


cgccttcgtg 


cactggcccg 


accgccccat 


ggacaagaac 


cccaccatga 


acttgctgac 


21600 


gggggtgccc 


aacggcatgc 


tccagtcgcc 


ccaggtggaa 


cccaccctgc 


gccgcaacca 


21660 


ggaggcgctc 


taccgcttcc 


tcaacgccca 


ctccgcctac 


tttcgctccc 


accgcgcgcg 


21720 


f*3 t"f na na a n 


c\cc7k rrnr r*t* 
yi.i_.cLi«i_yi_i_ l 


trnarrnrat 


y ad LLddLjaL 


atfltaaarrn 
dL Ly LdddLLtj 


LCJXgi.yT.ciLy 




tgaatgcttt 


attcataata 


aacagcacat 


gtttatgcca 


ccttttctga 


ggctctgact 


21840 


ttatttagaa 


atcgaagggg 


ttctgccggc 


tctcggcgtg 


ccccgcgggc 


agggatacgt 


21900 
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tgcggaactg 


gtacttgggc 


agccacttga 


actcggggat 


cagcagcttc 


ggcacgggga 


21960 


ggtcggggaa 


cgagtcgctc 


cacagcttgc 


gcgtgagttg 


cagggcgccc 


agcaggtcgg 


22020 


gcgcggagat 


cttgaaatcg 


cagttgggac 


ccgcgttctg 


cgcgcgggag 


ttgcggtaca 


22080 


cggggttgca 


gcactggaac 


accatcaggg 


ccgggtgctt 


cacgctcgcc 


agcaccgtcg 


22140 


cgtcggtgat 


gccctccacg 


tccagatcct 


cggcgttggc 


catcccgaag 


ggggtcatct 


22200 


tgcaggtctg 


ccgccccatg 


ctgggcacgc 


agccgggctt 


gtggttgcaa 


tcgcagtgca 


22260 


gggggatcag 


catcatctgg 


gcctgctcgg 


agctcatgcc 


cgggtacatg 


gccttcatga 


22320 


aagcctccag 


ctggcggaag 


gcctgctgcg 


ccttgccgcc 


ctcggtgaag 


aagaccccgc 


22380 


aggacttgct 


agagaactgg 


ttggtggcgc 


agccggcgtc 


gtgcacgcag 


cagcgcgcgt 


22440 


cgttgttggc 


cagctgcacc 


acgctgcgcc 


cccagcggtt 


ctgggtgatc 


ttggcccggt 


22500 


cggggttctc 


cttcagcgcg 


cgctgcccgt 


tctcgctcgc 


cacatccatc 


tcgatcgtgt 


22560 


gctccttctg 


gatcatcacg 


gtcccgtgca 


ggcatcgcag 


cttgccctcg 


gcctcggtgc 


22620 


acccgtgcag 


ccacagcgcg 


cagccggtgc 


actcccagtt 


cttgtgggcg 


atctgggagt 


22680 


gcgagtgcac 


gaagccctgc 


aggaagcggc 


ccatcatcgt 


ggtcagggtc 


ttgttgctgg 


22740 


tgaaggtcag 


cgggatgccg 


cggtgctcct 


cgttcacata 


caggtggcag 


atgcggcggt 


22800 


acacctcgcc 


ctgctcgggc 


atcagctgga 


aggcggactt 


caggtcgctc 


tccacgcggt 


22860 


accggtccat 


cagcagcgtc 


atgacttcca 


tgcccttctc 


ccaggccgag 


acgatcggca 


22920 


ggctcagggg 


gttcttcacc 


gccgttgtca 


tcttagtcgc 


cgccgctgag 


gtcagggggt 


22980 


cgttctcgtc 


cagggtctca 


aacactcgct 


tgccgtcctt 


ctcggtgatg 


cgcacggggg 


23040 


gaaagctgaa 


gcccacggcc 


gccagctcct 


cctcggcctg 


cctttcgtcc 


tcgctgtcct 


23100 


ggctgatgtc 


ttgcaaaggc 


acatgcttgg 


tcttgcgggg 


tttctttttg 


ggcggcagag 


23160 


gcggcggcgg 


agacgtgctg 


ggcgagcgcg 


agttctcgct 


caccacgact 


atttcttctt 


23220 


cttggccgtc 


gtccgagacc 


acgcggcggt 


aggcatgcct 


cttctggggc 


agaggcggag 


23280 


gcgacgggct 


ctcgcggttc 


ggcgggcggc 


tggcagagcc 


ccttccgcgt 


tcgggggtgc 


23340 


gctcctggcg 


gcgctgctct 


gactgacttc 


ctccgcggcc 


ggccattgtg 


ttctcctagg 


23400 


gagcaacaag 


catggagact 


cagccatcgt 


cgccaacatc 


gccatctgcc 


cccgccgccg 


23460 


ccgacgagaa 


ccagcagcag 


aatgaaagct 


taaccgcccc 


gccgcccagc 


cccacctccg 


23520 


acgccgccgc 


ggccccagac 


atgcaagaga 


tggaggaatc 


catcgagatt 


gacctgggct 


23580 


acgtgacgcc 


cgcggagcac 


gaggaggagc 


tggcagcgcg 


cttttcagcc 


ccggaagaga 


23640 


accaccaaga 


gcagccagag 


caggaagcag 


agagcgagca 


gcagcaggct 


gggctcgagc 


23700 


atggcgacta 


cctgagcggg 


gcagaggacg 


tgctcatcaa 


gcatctggcc 


cgccaatgca 


23760 


tcatcgtcaa 


ggacgcgctg 


ctcgaccgcg 


ccgaggtgcc 


cctcagcgtg 


gcggagctca 


23820 


gccgcgccta 


cgagcgcaac 


ctcttctcgc 


cgcgcgtgcc 


ccccaagcgc 


cagcccaacg 


23880 


gcacctgcga 


gcccaacccg 


cgcctcaact 


tctacccggt 


cttcgcggtg 


cccgaggccc 


23940 
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tggccaccta ccacctcttt ttcaagaacc aaaggatccc cgtctcctgc cgcgccaacc 24000 

gcacccgcgc cgacgccctg ctcaacctgg gtcccggcgc ccgcctacct gatatcgcct 24060 

ccttggaaga ggttcccaag atcttcgagg gtctgggcag cgacgagact cgggccgcga 24120 

acgctctgca aggaagcgga gaggagcatg agcaccacag cgccctggtg gagttggaag 24180 

gcgacaacgc gcgcctggcg gtgctcaagc gcacggtcga gctgacccac ttcgcctacc 24240 

cggcgctcaa cctgcccccc aaggtcatga gcgccgtcat ggaccaggtg ctcatcaagc 24300 

gcgcctcgcc cctctcggat gaggacatgc aggaccccga gagctcggac gagggcaagc 24360 

ccgtggtcag cgacgagcag ctggcgcgct ggctgggagc gagtagcacc ccccagagct 24420 

tggaagagcg gcgcaagctc atgatggccg tggtcctggt gaccgtggag ctggagtgtc 24480 

tgcgccgctt cttcgccgac gcagagaccc tgcgcaaggt cgaggagaac ctgcactacc 24540 

tcttcaggca cgggtttgtg cgccaggcct gcaagatctc caacgtggag ctgaccaacc 24600 

tggtctccta catgggcatc ctgcacgaga accgcctggg gcagaacgtg ctgcacacca 24660 

ccctgcgcgg ggaggcccgc cgcgactaca tccgcgactg cgtctacctg tacctctgcc 24720 

acacctggca gacgggcatg ggcgtgtggc agcagtgcct ggaggagcag aacctgaaag 24780 

agctctgcaa gctcctgcag aagaacctga aggccctgtg gaccgggttc gacgagcgca 24840 

ccaccgcctc ggacctggcc gacctcatct tccccgagcg cctgcggctg acgctgcgca 24900 

acggactgcc cgactttatg agtcaaagca tgttgcaaaa ctttcgctct ttcatcctcg 24960 

aacgctccgg gatcctgccc gccacctgct ccgcgctgcc ctcggacttc gtgccgctga 25020 

ccttccgcga gtgccccccg ccgctctgga gccactgcta cctgctgcgc ctggccaact 25080 

acctggccta ccactcggac gtgatcgagg acgtcagcgg cgagggtctg ctcgagtgcc 25140 

actgccgctg caacctctgc acgccgcacc gctccctggc ctgcaacccc cagctgctga 25200 

gcgagaccca gatcatcggc accttcgagt tgcaaggccc cggcgagggc aaggggggtc 25260 

tgaaactcac cccggggctg tggacctcgg cctacttgcg caagttcgtg cccgaggact 25320 

accatccctt cgagatcagg ttctacgagg accaatccca gccgcccaag gccgaactgt 25380 

cggcctgcgt catcacccag ggggccatcc tggcccaatt gcaagccatc cagaaatccc 25440 

gccaagaatt tctgctgaaa aagggccacg gggtctacct ggacccccag accggagagg 25500 

agctcaaccc cagcttcccc caggatgccc cgaggaagca gcaagaagct gaaagtggag 25560 

ctgccgccgc cggaggattt ggaggaagac tgggagagca gtcaggcaga ggaggaggag 25620 

atggaagact gggacagcac tcaggcagag gaggacagcc tgcaagacag tctggaagac 25680 

gaggtggagg aggaggcaga ggaagaagca gccgccgcca gaccgtcgtc ctcggcggag 25740 

aaagcaagca gcacggatac catctccgct ccgggtcggg gtcgcggcga ccgggcccac 25800 

agtaggtggg acgagaccgg gcgcttcccg aaccccacca cccagaccgg taagaaggag 25860 

cggcagggat acaagtcctg gcgggggcac aaaaacgcca tcgtctcctg cttgcaagcc 25920 

tgcgggggca acatctcctt cacccgccgc tacctgctct tccaccgcgg ggtgaacttc 25980 

ccccgcaaca tcttgcatta ctaccgtcac ctccacagcc cctactactg tttccaagaa 26040 
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gaggcagaaa 


cccagcagca 


gcagaaaacc 


agcggcagca 


gcagctagaa 


aatccacagc 


26100 


ggcggcaggt 


ggactgagga 


tcgcagcgaa 


cgagccggcg 


cagacccggg 


agctgaggaa 


26160 


ccggatcttt 


cccaccctct 


atgccatctt 


ccagcagagt 


cgggggcagg 


agcaggaact 


26220 


gaaagtcaag 


aaccgttctc 


tgcgctcgct 


cacccgcagt 


tgtctgtatc 


acaagagcga 


26280 


agaccaactt 


cagcgcactc 


tcgaggacgc 


cgaggctctc 


ttcaacaagt 


actgcgcgct 


26340 


cactcttaaa 


gagtagcccg 


cgcccgccca 


cacacggaaa 


aaggcgggaa 


ttacgtcacc 


26400 


acctgcgccc 


ttcgcccgac 


catcatcatg 


agcaaagaga 


ttcccacgcc 


ttacatgtgg 


26460 


agctaccagc 


cccagatggg 


cctggccgcc 


ggcgccgccc 


aggactactc 


cacccgcatg 


26520 


aactggctca 


gcgccgggcc 


cgcgatgatc 


tcacgggtga 


atgacatccg 


cgcccgccga 


26580 


aaccagatac 


tcctagaaca 


gtcagcgatc 


accgccacgc 


cccgccatca 


ccttaatccg 


26640 


cgtaattggc 


ccgccgccct 


ggtgtaccag 


gaaattcccc 


agcccacgac 


cgtactactt 


26700 


ccgcgagacg 


cccaggccga 


agtccagctg 


actaactcag 


gtgtccagct 


ggccggcggc 


26760 


gccgccctgt 


gtcgtcaccg 


ccccgctcag 


ggtataaagc 


ggctggtgat 


ccgaggcaga 


26820 


ggcacacagc 


tcaacgacga 


ggtggtgagc 


tcttcgctgg 


gtctgcgacc 


tgacggagtc 


26880 


ttccaactcg 


ccggatcggg 


gagatcttcc 


ttcacgcctc 


gtcaggccgt 


cctgactttg 


26940 


gagagttcgt 


cctcgcagcc 


ccgctcgggt 


ggcatcggca 


ctctccagtt 


cgtggaggag 


27000 


ttcactccct 


cggtctactt 


caaccccttc 


tccggctccc 


ccggccacta 


cccggacgag 


27060 


ttcatcccga 


acttcgacgc 


catcagcgag 


tcggtggacg 


gctacgattg 


aatgtcccat 


27120 


ggtggcgcag 


ctgacctagc 


tcggcttcga 


cacctggacc 


actgccgccg 


cttccgctgc 


27180 


ttcgctcggg 


atctcgccga 


gtttgcctac 


tttgagctgc 


ccgaggagca 


ccctcagggc 


27240 


ccggcccacg 


gagtgcggat 


catcgtcgaa 


gggggcctcg 


actcccacct 


gcttcggatc 


27300 


ttcagccagc 


gaccgatcct 


ggtcgagcgc 


gagcaaggac 


agacccttct 


gaccctgtac 


27360 


tgcatctgca 


accaccccgg 


cctgcatgaa 


agtctttgtt 


gtctgctgtg 


tactgagtat 


27420 


aataaaagct 


gagatcagcg 


actactccgg 


actcgattgt 


ggtgttcctg 


ctatcaaccg 


27480 


gtccctgttc 


ttcaccggga 


acgagaccga 


gctccagctt 


cagtgtaagc 


cccacaagaa 


27540 


gtacctcacc 


tggctgttcc 


agggctcccc 


gatcgccgtt 


gtcaaccact 


gcgacaacga 


27600 


cggagtcctg 


ctgagcggcc 


ccgccaacct 


tactttttcc 


acccgcagaa 


gcaagctcca 


27660 


gctcttccaa 


cccttcctcc 


ccgggaccta 


tcagtgcgtc 


tcgggaccct 


gccatcacac 


27720 


cttccacctg 


atcccgaata 


ccacagcgcc 


gctccccgct 


actaacaacc 


aaactaccca 


27780 


ccatcgccac 


cgtcgcgacc 


tttctgaatc 


taacactacc 


acccacaccg 


gaggtgagct 


27840 


ccgaggtcga 


ccaacctctg 


ggatttacta 


cggcccctgg 


gaggtggtgg 


ggttaatagc 


27900 


gctaggccta 


gttgtgggtg 


ggcttttggc 


tctctgctac 


ctatacctcc 


cttgctgttc 


27960 


gtacttagtg 


gtgctgtgtt 


gctggtttaa 


gaaatgggga 


agatcaccct 


agtgagctgc 


28020 


ggtgcgctgg 


tggcggtggt 


ggtgttttcg 


attgtgggac 


tgggcggcgc 


ggctgtagtg 


28080 
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aaggagaagg 


ccgatccctg 


cttgcatttc 


aatcccgaca 


attgccagct 


gagttttcag 


28140 


cccgatggca 


atcggtgcgc 


ggtgctgatc 


aagtgcggat 


gggaatgcga 


gaacgtgaga 


28200 


atcgagtaca 


ataacaagac 


tcggaacaat 


actctcgcgt 


ccgtgtggca 


gcccggggac 


28260 


cccgagtggt 


acaccgtctc 


tgtccccggt 


gctgacggct 


ccccgcgcac 


cgtgaacaat 


28320 


actttcattt 


ttgcgcacat 


gtgcgacacg 


gtcatgtgga 


tgagcaagca 


gtacgatatg 


28380 


tggcccccca 


cgaaggagaa 


catcgtggtc 


ttctccatcg 


cttacagcgc 


gtgcacggcg 


28440 


ctaatcaccg 


ctatcgtgtg 


cctgagcatt 


cacatgctca 


tcgctattcg 


ccccagaaat 


28500 


aatgccgaaa 


aagagaaaca 


gccataacac 


gttttttcac 


acaccttttt 


cagaccatgg 


28560 


cctctgttaa 


atttttgctt 


ttatttgcca 


gtctcattac 


tgttataagt 


aatgagaaac 


28620 


tcactattta 


cattggcact 


aaccacactt 


tagacggaat 


tccaaaatcc 


tcatggtatt 


28680 


gctattttga 


tcaagatcca 


gacttaacta 


tagaactgtg 


tggtaacaag 


ggaaaaaata 


28740 


caagcattca 


tttaattaac 


tttaattgcg 


gagacaattt 


gaaattaatt 


aatatcacta 


28800 


aagagtatgg 


aggtatgtat 


tactatgttg 


cagaaaataa 


caacatgcag 


ttttatgaag 


28860 


ttactgtaac 


taatcccacc 


acacctagaa 


caacaacaac 


caccaccaca 


aaaactacac 


28920 


ctgttaccac 


tatgcagctc 


actaccaata 


acatttttgc 


catgcgtcaa 


atggtcaaca 


28980 


atagcactca 


acccacccca 


cccagtgagg 


aaattcccaa 


atccatgatt 


ggcattattg 


29040 


ttgctgtagt 


ggtgtgcatg 


ttgatcatcg 


ccttgtgcat 


ggtgtactat 


gccttctgct 


29100 


acagaaagca 


cagactgaac 


gacaagctgg 


aacacttact 


aagtgttgaa 


ttttaatttt 


29160 


ttagaaccat 


gaagatccta 


ggccttttaa 


ttttttctat 


cattacctct 


gctctatgca 


29220 


attctgacaa 


tgaggacgtt 


actgtcgttg 


tcggaaccaa 


ttatacactg 


aaaggtccag 


29280 


cgaagggtat 


gctttcgtgg 


tattgctggt 


ttggaactga 


cgagcaacag 


acagagctct 


29340 


gcaatgctca 


aaaaggcaaa 


acctcaaatt 


ctaaaatctc 


taattatcaa 


tgcaatggca 


29400 


ctgacttagt 


actgctcaat 


gtcacgaaag 


catatgctgg 


cagctacacc 


tgccctggag 


29460 


atgatactga 


gaacatgatt 


ttttacaaag 


tggaagtggt 


tgatcccact 


actccacctc 


29520 


cacccaccac 


aactactcac 


accacacaca 


cagaacaaac 


cacagcagag 


gaggcagcaa 


29580 


agttagcctt 


gcaggtccaa 


gacagttcat 


ttgttggcat 


tacccctaca 


cctgatcagc 


29640 


ggtgtccggg 


gctgctcgtc 


agcggcattg 


tcggtgtgct 


ttcgggatta 


gcagtcataa 


29700 


tcatctgcat 


gttcattttt 


gcttgctgct 


atagaaggct 


ttaccgacaa 


aaatcagacc 


29760 


cactgctgaa 


cctctatgtt 


taattttttc 


cagagccatg 


aaggcagtta 


gcactctagt 


29820 


tttttgttct 


ttgattggca 


ctgtttttag 


tgttagcttt 


ttgaaacaaa 


tcaatgttac 


29880 


tgagggggaa 


aatgtgacac 


tggtaggcgt 


agagggtgct 


caaaatacca 


cctggacaaa 


29940 


attccatcta 


gatgggtgga 


aagaaatttg 


cacctggaat 


gtcagtactt 


atacatgtga 


30000 


«yy<*y i Laa l 


r 1" 1" a r* r a 1" +* n 


traatntran 


rraaattraa 


aannnttnna 
ciciyy y l lyya 


a a n fin ra 


30060 


atctgttagt 


gttagcaata 


gtgggtacta 


tacccagcat 


actcttatct 


atgacattat 


30120 


agttatacca 


ctgcctacac 


ctagcccacc 


tagcactacc 


acacagacaa 


cccacactac 


30180 
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acaaacaacc 


acatacagta 


catcaaatca 


gcctaccacc 


actacaacag 


cagaggttgc 


30240 


cagctcgtct 


ggggtccgag 


tggcattttt 


gatgttggcc 


ccatctagca 


gtcccactgc 


30300 


tagtaccaat 


gagcagacta 


ctgaattttt 


gtccactgtc 


gagagccaca 


ccacagctac 


30360 


ctcgagtgcc 


ttctctagca 


ccgccaatct 


atcctcgctt 


tcctctacac 


caatcagtcc 


30420 


cgctactact 


cctacccccg 


ctattctccc 


cactcccctg 


aagcaaacag 


acggcgacat 


30480 


gcaatggcag 


atcaccctgc 


tcattgtgat 


cgggttggtc 


atcctggccg 


tgttgctcta 


30540 


ctacatcttc 


tgccgccgca 


ttcccaacgc 


gcaccgcaag 


ccggcctaca 


agcccatcgt 


30600 


tgtcgggcag 


ccggagccgc 


ttcaggtgga 


agggggtcta 


aggaatcttc 


tcttctcttt 


30660 


tacagtatgg 


tgattgaatt 


atgattccta 


gacaaatctt 


gatcactatt 


cttatctgcc 


30720 


tcctccaagt 


ctgtgccacc 


ctcgctctgg 


tggccaacgc 


cagtccagac 


tgtattgggc 


30780 


ccttcgcctc 


ctacgtgctc 


tttgccttca 


tcacctgcat 


ctgctgctgt 


agcatagtct 


30840 


gcctgcttat 


caccttcttc 


cagttcattg 


actggatctt 


tgtgcgcatc 


gcctacctgc 


30900 


gccaccaccc 


ccagtaccgc 


gaccagcgag 


tggcgcggct 


gctcaggatc 


ctctgataag 


30960 


catgcgggct 


ctgctacttc 


tcgcgcttct 


gctgttagtg 


ctcccccgtc 


ccgtcgaccc 


31020 


ccggaccccc 


acccagtccc 


ccgaggaggt 


ccgcaaatgc 


aaattccaag 


aaccctggaa 


31080 


attcctcaaa 


tgctaccgcc 


aaaaatcaga 


catgcatccc 


agctggatca 


tgatcattgg 


31140 


gatcgtgaac 


attctggcct 


gcaccctcat 


ctcctttgtg 


atttacccct 


gctttgactt 


31200 


tggttggaac 


tcgccagagg 


cgctctatct 


cccgcctgaa 


cctgacacac 


caccacagca 


31260 


acctcaggca 


cacgcactac 


caccaccacc 


acagcctagg 


ccacaataca 


tgcccatatt 


31320 


agactatgag 


gccgagccac 


agcgacccat 


gctccccgct 


attagttact 


tcaatctaac 


31380 


cggcggagat 


gactgaccca 


ctggccaaca 


acaacgtcaa 


cgaccttctc 


ctggacatgg 


31440 


acggccgcgc 


ctcggagcag 


cgactcgccc 


aacttcgcat 


tcgccagcag 


caggagagag 


31500 


ccgtcaagga 


gctgcaggac 


ggcatagcca 


tccaccagtg 


caagaaaggc 


atcttctgcc 


31560 


tggtgaaaca 


ggccaagatc 


tcctacgagg 


tcacccagac 


cgaccatcgc 


ctctcctacg 


31620 


agctcctgca 


gcagcgccag 


aagttcacct 


gcctggtcgg 


agtcaacccc 


atcgtcatca 


31680 


cccagcagtc 


gggcgatacc 


aaggggtgca 


tccactgctc 


ctgcgactcc 


cccgactgcg 


31740 


tccacactct 


gatcaagacc 


ctctgcggcc 


tccgcgacct 


cctccccatg 


aactaatcac 


31800 


ccccttatcc 


agtgaaataa 


agatcatatt 


gatgatttga 


gtttaataaa 


aataaagaat 


31860 


cacttacttg 


aaatctgata 


ccaggtctct 


gtccatgttt 


tctgccaaca 


ccacttcact 


31920 


cccctcttcc 


cagctctggt 


actgcaggcc 


ccggcgggct 


gcaaacttcc 


tccacaccct 


31980 


gaaggggatg 


tcaaattcct 


cctgtccctc 


aatcttcatt 


ttatcttcta 


tcagatgtcc 


32040 


aaaaagcgcg 


tccgggtgga 


tgatgacttc 


gaccccgtct 


acccctacga 


tgcagacaac 


32100 


gcaccgaccg 


tgcccttcat 


caaccccccc 


ttcgtctctt 


cagatggatt 


ccaagagaag 


32160 


cccctggggg 


tgctgtccct 


gcgtctggcc 


gatcccgtca 


ccaccaagaa 


cggggaaatc 


32220 
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accctcaaac 


taaaaaataa 
L yyy a y a L yy 


ggtggacctc 


aactcctcqq 


gaaaactcat 


ctccaacacg 


32280 


gccaccaagg 


ccgccgcccc 


tctcagtttt 


tccaacaaca 


ccatttccct 


taacatggat 


32340 


accccttttt 


acaacaacaa 


tgqaaagtta 


ggcatgaaag 


tcactgctcc 


actgaagata 


32400 


ctagacacag 


acttgctaaa 


aacacttgtt 


gtagcttatg 


gacaaggttt 

3 33 


aggaacaaac 


32460 


accactqqtq 


cccttgttgc 


ccaactagca 


tccccacttg 


cttttgatag 


caatagcaaa 


32520 


attgccctta 


atttaggcaa 


tggaccattg 


aaaqtqqatq 

3 w 33 w w 3 


caaatagact 


gaacatcaat 


32580 


tqcaataqaq 


gactctatgt 


tactaccaca 


aaagatgcac 


tggaagccaa 

~3 3~ — 3 — — 


tataagttgg 


32640 


gctaatgcta 


tgacatttat 


aggaaatgcc 


atgggtgtca 


atattgatac 


acaaaaaggc 


32700 


ttgcaatttg 


gcaccactag 


taccgtcgca 


gatgttaaaa 


acgcttaccc 


catacaaatc 


32760 


aaacttggag 


ctggtctcac 


atttgacagc 


acaggtgcaa 


ttgttgcatg 


gaacaaagat 


32820 


gatgacaagc 


ttacactatg 


gaccacagcc 


gacccctctc 


caaattgtca 


catatattct 


32880 


gaaaaggatg 


ctaagcttac 


actttgcttg 


acaaagtgtg 


qcaqtcaqat 


tctgggcact 


32940 


gtttccctca 


tagctgttga 


tactggcagt 


ttaaatccca 


taacaggaac 


agtaaccact 


33000 


gctcttgtct 


cacttaaatt 


cgatgcaaat 


qgagttttgc 

33 3 w w w 3 


aaagcagctc 


aacactagac 


33060 


tcagactatt 


ggaatttcag 


acagggagat 


gttacacctg 


ctgaagccta 


tactaatgct 


33120 


ataggtttca 


tgcccaatct 


aaaagcatac 


cctaaaaaca 


caagtggagc 


tgcaaaaagt 


33180 


cacattgttg 


ggaaagtgta 


cctacatggg 


gatacaggca 


aaccactgga 


cctcattatt 


33240 


actttcaatg 


aaacaagtga 


tgaatcttgc 


acttactgta 


ttaactttca 


atqqcagtqq 

h 33 3 h 33 


33300 


qqqqctqatc 

3 33 3*" **3** 


aatataaaaa 


tgaaacactt 


gccgtcagtt 


cattcacctt 


ttcctatatt 


33360 


gctaaagaat 


aaaccccact 


ctgtacccca 


tctctgtcta 


tggaaaaaac 


tctgaaacac 


33420 


aaaataaaat 


aaagttcaag 


tgttttattg 


attcaacagt 


tttacaggat 


tcgagcagtt 


33480 


atttttcctc 


caccctccca 


qqacatgqaa 


tacaccaccc 


tctccccccg 


cacagccttg 


33540 


aacatctgaa 


tgccattggt 


qatqqacatq 

3 33 w 3 


cttttggtct 


ccacgttcca 


cacagtttca 


33600 


qaqcqaqcca 


gtctcgggtc 


qqtcaqqqaq 

33 wvw 333 u 3 


atgaaaccct 


ccgggcactc 


ccgcatctgc 


33660 


acctcacagc 


tcaacagctg 


aqqattqtcc 


tcqgtggtcq 

33 33 *" *~3 


qqatcacqqt 


tatctggaag 


33720 


aagcagaaga 


qcqqcqqtqq 


gaatcatagt 


ccgcgaacgg 


qatcqqccqq 

3 wv 33 w **33 


tqgtqtcgca 

33 *"3 *■ ^"3 


33780 


tcaggccccg 


cagcagtcgc 


tgtcgccgcc 


gctccgtcaa 


gctgctgctc 


aqqqqqtccq 

**33333 3 


33840 


ggtccaqqqa 


ctccctcagc 


atgatgccca 


cggccctcag 


catcagtcgt 


ctgqtqcggc 

**3 3 3 3 3** 


33900 


gggcgcagca 


gcgcatgcgg 

3 ^"3 3 3 3 


atctcgctca 


ggtcgctgca 

3 3 3 3 


gtacgtgcaa 


cacaggacca 


33960 


ccaggttgtt 


caacagtcca 


tagttcaaca 


cgctccagcc 


gaaactcatc 


qcqqqaaqqa 


34020 


tgctacccac 


gtggccgtcg 

3 33 3 *"~3 


taccagatcc 


tcaqqtaaat 


caagtqqcqc 

3 w 3 3 3 ^* 


cccctccaga 


34080 


acacgctgcc 


catgtacatg 


atctccttgg 


qcatqtqqcq 


gttcaccacc 


tcccggtacc 


34140 


acatcaccct 


ctggttgaac 


atgcagcccc 


ggatgatcct 


gcggaaccac 


agggccagca 


34200 


ccgccccgcc 


cgccatgcag 


cgaagagacc 


ccgggtcccg 


acaatggcaa 


tggaggaccc 


34260 


accgctcgta 


cccgtggatc 


atctgggagc 


tgaacaagtc tatgttggca 
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atatgctcat 


gcatctcttc 


agcactctca 


gctcctcggg 


ggtcaaaacc 


atatcccagg 


34380 


gcacggggaa 


ctcttgcagg 


acagcgaacc 


ccgcagaaca 


gggcaatcct 


cgcacataac 


34440 


ttacattgtg 


catggacagg 


gtatcgcaat 


caggcagcac 


cgggtgatcc 


tccaccagag 


34500 


aagcgcgggt 


ctcggtctcc 


tcacagcgtg 


gtaagggggc 


cggccgatac 


gggtgatggc 


34560 


gggacgcggc 


tgatcgtgtt 


cgcgaccgtg 


ttatgatgca 


gttgctttcg 


gacattttcg 


34620 


tacttgctgt 


agcagaacct 


ggtccgggcg 


ctgcacaccg 


atcgccggcg 


gcggtcccgg 


34680 


cgcttggaac 


gctcggtgtt 


gaagttgtaa 


aacagccact 


ctctcagacc 


gtgcagcaga 


34740 


tctagggcct 


caggagtgat 


gaagatccca 


tcatgcctga 


tggctctaat 


cacatcgacc 


34800 


accgtggaat 


gggccagacc 


cagccagatg 


atgcaatttt 


gttgggtttc 


ggtgacggcg 


34860 


ggggagggaa 


gaacaggaag 


aaccatgatt 


aacttttaat 


ccaaacggtc 


tcggagcact 


34920 


tcaaaatgaa 


gatcgcggag 


atggcacctc 


tcgcccccgc 


tgtgttggtg 


gaaaataaca 


34980 


gccaggtcaa 


aggtgatacg 


gttctcgaga 


tgttccacgg 


tggcttccag 


caaagcctcc 


35040 


acgcgcacat 


ccagaaacaa 


gacaatagcg 


aaagcgggag 


ggttctctaa 


ttcctcaatc 


35100 


atcatgttac 


actcctgcac 


catccccaga 


taattttcat 


ttttccagcc 


ttgaatgatt 


35160 


cgaactagtt 


cctgaggtaa 


atccaagcca 


gccatgataa 


agagctcgcg 


cagagcgccc 


35220 


tccaccggca 


ttcttaagca 


caccctcata 


attccaagat 


attctgctcc 


tggttcacct 


35280 


gcagcagatt 


gacaagcgga 


atatcaaaat 


ctctgccgcg 


atccctaagc 


tcctccctca 


35340 


gcaataactg 


taagtactct 


ttcatatcct 


ctccgaaatt 


tttagccata 


ggaccaccag 


35400 


gaataagatt 


agggcaagcc 


acagtacaga 


taaaccgaag 


tcctccccag 


tgagcattgc 


35460 


caaatgcaag 


actgctataa 


gcatgctggc 


tagacccggt 


gatatcttcc 


agataactgg 


35520 


acagaaaatc 


gcccaggcaa 


tttttaagaa 


aatcaacaaa 


agaaaaatcc 


tccaggtgca 


35580 


cgtttagagc 


ctcgggaaca 


acgatggagt 


aaatgcaagc 


ggtgcgttcc 


agcatggtta 


35640 


gttagctgat 


ctgtagaaaa 


aaacaaaaat 


gaacattaaa 


ccatgctagc 


ctggcgaaca 


35700 


ggtgggtaaa 


tcgttctctc 


cagcaccagg 


caggccacgg 


ggtctccggc 


acgaccctcg 


35760 


taaaaattgt 


cgctatgatt 


gaaaaccatc 


acagagagac 


gttcccggtg 


gccggcgtga 


35820 


atgattcgac 


aagatgaata 


cacccccgga 


acattggcgt 


ccgcgagtga 


aaaaaagcgc 


35880 


ccaaggaagc 


aataaggcac 


tacaatgctc 


agtctcaagt 


ccagcaaagc 


gatgccatgc 


35940 


ggatgaagca 


caaaattctc 


aggtgcgtac 


aaaatgtaat 


tactcccctc 


ctgcacaggc 


36000 


agcaaagccc 


ccgatccctc 


caggtacaca 


tacaaagcct 


cagcgtccat 


agcttaccga 


36060 


gcagcagcac 


acaacaggcg 


caagagtcag 


agaaaggctg 


agctctaacc 


tgtccacccg 


36120 


ctctctgctc 


aatatatagc 


ccagatctac 


actgacgtaa 


aggccaaagt 


ctaaaaatac 


36180 


ccgccaaata 


atcacacacg 


cccagcacac 


gcccagaaac 


cggtgacaca 


ctcaaaaaaa 


36240 


tacgcgcact 


tcctcaaacg 


cccaaactgc 


cgtcatttcc 


gggttcccac 


gctacgtcat 


36300 


caaaattcga 


ctttcaaatt 


ccgtcgaccg 


ttaaaaacgt 


cgcccgcccc 


gcccctaacg 


36360 
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gtcgccgctc ccgcagccaa tcaccgcccc gcatccccaa attcaaatac ctcatttgca 36420 
tattaacgcg caccaaaagt ttgaggtata ttattgatga tg 36462 

<210> 2 

<211> 36604 

<212> DNA 

<213> chimpanzee adenovirus serotype Pan6 



<400> 2 
catcatcaat 

U.U U U> U L. b< U U U 


aatatacctc 


aaacttttaa 

bvuub. b v* w b«jy 


tacacattaa 

ty v»y b bbibv 


tatacaaatQ aactatttaa 


60 


atttaaaaaa 


aaaaaaaaat 


aattaactac 

y a u tyy*. ^y*« 


aaaaacaaca 


accattaaaa acaaaQcaaa 

«v,u,y u u«yyy y^-yyyy^yyy 


120 


1 - n a c n t 1" tt a 

LyaLy ill uy 


a Lyacy uyyu. 


tataaaacaa 


aaccaattta 
y yy *■ *» ^y 


caaattctca taaaaaaaat 


180 


a a catenate 


nanntntant 
yayy uy Lyy l 


ttaaararaa 

u uy uuuauyy 


aaatactcaa 

uau uuu. u van 


ttttcccaca ctctctaaca 


240 


yy<*»<* *-y & yy 


tatttrtaan 

ty i- L.U. Lyyy 


COaatacaaG 
L>yya uyu.ci.cxy 


taaaaa cqqq 

uyc4.ciciuu.yyy 


ccattttcac acaaaaacta 

bib>u b> b b. bvuv. y v>yuuuu\» y 


300 




taaaaatrta 

L.y CXCLCLU L.y 


aat aattt cg 

ay uaa u u uu.y 


catttataac 
»— y i— l yy 


aaaaaaaaat atttaccaaa 
yy y yy y **u uuyb*v»y«.y 


360 


y y uuy ciy my 


artttaarca 

aU. L, u uy av.L.y 


attacat aaa 

a u u clu. y uy y y 


aatttcaatt 

yy u u LL.yo u u 


accatatttt tcacctaaat 

b\bjL>y bU b b b b b\>UV>\> bUUU b 


420 


ttecacatac 

U LL.L.yL.y LUL 


aatatcaaaa 


tccaatattt 

u u u y y uy u u u 


ttacataaoc 


atcaactaat caccaaaata 


480 


tttaaaccta 

u u c auaVa v. v. y 


cactctctaa 

L»y »- b— ^- v. luh 


tcaaaaaacc 


actcttaaat 


gecagegagt agagttttct 


540 


CCtCCQCQCC 


acaaatcaaa 


tctacacttt 


aaaaaataaa 


gcacctgaga gacctgcccg 


600 


ataatatttt 

y i_cm c y u u u u 


cctaactact 

V— tyy v. b. ul. l 


aaaaacaaaa 
yy y y y 


ttctaaaatt 


aataataaac accataataq 


660 


ataacaaccc 

y uy u\.yu.«- l»u» 


tccaaaaccc 

vuy uy v_ v_ \_ 


cctaccccat 

b. b. b la. v» \«vvu b 


ttaaaacacc 

v. wy t^y y ^y 


ttcactatac aatttatata 

w *• Z3 ****** **** *■ w w *j w** w 


720 


atctaaaaat 


aaatataccc 
yy y uyv«\«v 


aaaaacaacc 

y u M u y vyuw 


ctaacaaaaa 


aacaataaat aatttattta 

yy yy *»y**** y ~ ^y b** 


780 


y wyu kyv>^>yv 


actactaact 

y *-y v.uyyv« v» 


accaaacaaa 

y»»»y tt y» tt yy 


ctaatacaaa 


ctctaactca aacaacaatt 


840 


cctctctcca 

%• b> w L» I. V. U L>LibL 


tacrrraaaa 

t- u v. \— v. L»y ^*y *•* 


yy y y 


otaaaaaaaa 

y uy uyuuuuu 


aatccccaaa cttaaaaaaa 

yub^bbbyuy b> b buuuyyyy 


900 


aaaaactcaa 

uuyuy v v wyu 


crtacactac 

b» ky ~y *» y 


tataaaaaat 

l. u uy *-*y y **** *- 


acttacctcc 


aaacaataat aaaaaaaaca 

y y y y y yy yy y 


960 


aaaaaacaat 
^yydyy^-yct l. 


tcaaactaca 

Lv.y uy v. Ly Ly 


ataaaccaaa 


aaataaaaac 

y c*y tyuuuuv. 


tacaaacaaa aactttaacc 

by vyyy v«y wy y b**y 


1020 


taaactatcc 
99 y uu>u» 


tactctaccc 

Uau, U L. LyULU 


aaacacaact 

y yauouy y v» u 


ataaatctta 

y ufcAbLy l w %m by 


taaatttcat cacataaata 

W \j ** W* ^ *» W *m 1* W ** *j **** **^4 *A*A ** W* 


1080 


ctaaaaataa 

Lyyuy a uaa 


aaatataata 

ynuLy Lyn c y 


tataccctat 

uy uy u y u 


actatataaa 

y bu lu by uy 


aacttacaac cattatattt 

MM w b bMVUUV Vl* b by b y b b b 


1140 


acaataaata 

uv.ciy Laay u y 


taattaactt 

uy a u u ci d v_ u u 


taattaaaaa 

uay u uy yycxa 


aacaaaaaat 
yy v * a y a yyy ^ 


aactaaatac taactaattt 

y uv wyy y uybj byub y y 


1200 


atttatatat 

a. u u ua uy ua u 


atattttttt 


atatataaat 

auy uy uayy u 


cccatctcta 

u>v>u«y uu> tbiky 


acataaataa aacccccact 

uwy bviyubyu y u\>\>b<bavu\> b 


1260 


teagagtgea 


tttcatcacc 


cccagaaatt 


ggegaggaac 


cgcccgaaga tattattcat 


1320 


agaccagttg 


cagtgagagt 


caccgggcgg 


agagcagctg 


tggagagttt ggatgacttg 


1380 


ctacagggtg 


gggatgaacc 


tttggacttg 


tgtacccgga 


aacgccccag gcactaagtg 


1440 


ccacacatgt 


gtgtttactt 


aaggtgatgt 


cagtatttat 


agggtgtgga gtgcaataaa 


1500 


atccgtgttg 


actttaagtg 


cgtgttttat 


gactcagggg 


tggggactgt gggtatataa 


1560 


gcaggtgcag 


acctgtgtgg 


tcagttcaga 


gcaggactca 


tggagatctg gactgtcttg 


1620 


gaagactttc 


accagactag 


acagttgeta 


gagaactcat 


eggagggagt ctcttacctg 


1680 


tggagattct 


gcttcggtgg 


gcctctagct 


aagctagtct 


atagggecaa acaggattat 


1740 



Page 19 



WO 2005/001103 



PCT/US2004/016614 



aaggaacaat 


ttgaggatat 


tttgagagag 


tgtcctggta 


tttttgactc 


tctcaacttg 


1800 


ggccatcagt 


ctcactttaa 


ccagagtatt 


ctgagagccc 


ttgacttttc 


tactcctggc 


1860 


agaactaccg 


ccgcggtagc 


ettttttgee 


tttattcttg 


acaaatggag 


tcaagaaacc 


1920 


catttcagca 


gggattaccg 


tctggactgc 


ttagcagtag 


ctttgtggag 


aacatggagg 


1980 


tgccagcgcc 


tgaatgcaat 


ctccggctac 


ttgecagtae 


ageeggtaga 


caegctgagg 


2040 


atcctgagtc 


tccagtcacc 


ccaggaacac 


caacgccgcc 


agcagccgca 


gcaggagcag 


2100 


cagcaagagg 


aggaccgaga 


agagaacccg 


agagceggtc 


tggaccctcc 


ggtggcggag 


2160 


gaggaggagt 


agctgacttg 


tttcccgagc 


tgcgccgggt 


gctgactagg 


tcttccagtg 


2220 


gacgggagag 


ggggattaag 


egggagagge 


atgaggagac 


tagecacaga 


actgaactga 


2280 


ctgtcagtct 


gatgagcege 


aggcgcccag 


aatcggtgtg 


gtggcatgag 


gtgeagtege 


2340 


aggggataga 


tgaggtctcg 


gtgatgcatg 


agaaatattc 


cctagaacaa 


gtcaagactt 


2400 


gttggttgga 


geccgaggat 


gattgggagg 


tagecatcag 


gaattatgee 


aagctggctc 


2460 


tgaagccaga 


caagaagtac 


aagattacca 


aactgattaa 


tatcagaaat 


tcctgctaca 


2520 


tttcagggaa 


tggggecgag 


gtggagatca 


gtacccagga 


gagggtggcc 


ttcagatgtt 


2580 


gtatgatgaa 


tatgtacccg 


ggggtggtgg 


gcatggaggg 


agtcaccttt 


atgaacacga 


2640 


ggttcagggg 


tgatgggtat 


aatggggtgg 


tctttatggc 


caacaccaag 


ctgacagtgc 


2700 


acggatgctc 


cttctttggc 


ttcaataaca 


tgtgcatcga 


ggcctggggc 

99 9 9 7 9 


agtgtttcag 


2760 


tgaggggatg 


cagcttttca 


gccaactgga 


tgggggtcgt 


gggcagaacc 


aagagcaagg 


2820 


tgtcagtgaa 


gaaatgcctg 


ttcgagaggt 


gccacctggg 


ggtgatgagc 


gagggegaag 

9 9 9 9 9 9 


2880 


ccaaagtcaa 


acactgcgcc 


tctaccgaga 


egggctgett 


tgtgctgatc 


aagggcaatg 


2940 


cccaagtcaa 


gcataacatg 


atctgtgggg 


ecteggatga 


gcgcggctac 


cagatgetga 


3000 


cctgcgccgg 


tgggaacagc 


catatgetgg 


ccaccgtgca 


tgtggcctcg 


cacccccgca 


3060 


agacatggcc 


cgagttcgag 


cacaacgtca 


tgacccgctg 


caatgtgcac 


ctgggctccc 


3120 


gccgaggcat 


gttcatgccc 


taccagtgea 


acatgeaatt 


tgtgaaggtg 


ctgctggagc 


3180 


ccgatgccat 


gtccagagtg 


agectgaegg 


gggtgtttga 


catgaatgtg 


gagctgtgga 


3240 


aaattctgag 


atatgatgaa 


tccaagacca 


ggtgccgggc 


ctgcgaatgc 


ggaggcaagc 

99 99 9 


3300 


acgccaggct 


tcagcccgtg 


tgtgtggagg 


tgacggagga 


cctgcgaccc 


gatcatttgg 


3360 


tgttgtcctg 


caaegggacg 


gagttegget 


ecagegggga 


agaatctgac 


tagagtgagt 


3420 


agtgtttggg 


gctgggtgtg 


agectgeatg 


aggggcagaa 


tgactaaaat 


ctgtggtttt 

9 9 9 


3480 


ctgtgtgttg 


cagcagcatg 


ageggaageg 


cctcctttga 


gggaggggta 

999 9999 


ttcagccctt 


3540 


atctgacggg 


gcgtctcccc 


tectgggegg 


gagtgegtea 


gaatgtgatg 


ggatccaegg 


3600 


tggacggccg 


gcccgtgcag 


cccgcgaact 


cttcaaccct 


gacctacgcg 


accctgagct 


3660 










cgccgccagc 


gecg xgcgcg 


5/ cA) 


gaatggccct 


gggcgccggc 


tactacagct 


ctctggtggc 


caactcgagt 


tccaccaata 


3780 


atcccgccag 


cctgaacgag 


gagaagctgc 


tgetgetgat 


ggcccagctc 


gaggecctga 


3840 
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cccagcgcct 


gggcgagctg 


acccagcagg 


tggctcagct 


gcaggcggag 


acgcgggccg 


3900 


cggttgccac 


ggtgaaaacc 


aaataaaaaa 


tgaatcaata 


aataaacgga 


gacggttgtt 


3960 


gattttaaca 


cagagtcttg 


aatctttatt 


tgatttttcg 


cgcgcggtag 


gccctggacc 


4020 


accggtctcg 


atcattgagc 


acccggtgga 


tcttttccag 


gacccggtag 


aggtgggctt 


4080 


ggatgttgag 


gtacatgggc 


atgagcccgt 


cccgggggtg 


gaggtagctc 


cattgcaggg 


4140 


cctcgtgctc 


ggggatggtg 


ttgtaaatca 


cccagtcata 


gcaggggcgc 


agggcgtggt 


4200 


gctgcacgat 


gtccttgagg 


aggagactga 


tggccacggg 


cagccccttg 


gtgtaggtgt 


4260 


tgacgaacct 


gttgagctgg 


gagggatgca 


tgcgggggga 


gatgagatgc 


atcttggcct 


4320 


ggatcttgag 


attggcgatg 


ttcccgccca 


gatcccgccg 


ggggttcatg 


ttgtgcagga 


4380 


ccaccagcac 


ggtgtatccg 


gtgcacttgg 


ggaatttgtc 


atgcaacttg 


gaagggaagg 


4440 


cgtgaaagaa 


tttggagacg 


cccttgtgac 


cgcccaggtt 


ttccatgcac 


tcatccatga 


4500 


tgatggcgat 


gggcccgtgg 


gcggcggcct 


gggcaaagac 


gtttcggggg 


tcggacacat 


4560 


cgtagttgtg 


gtcctgggtg 


agctcgtcat 


aggccatttt 


aatgaatttg 


gggcggaggg 


4620 


tgcccgactg 


ggggacgaag 


gtgccctcga 


tcccgggggc 


gtagttgccc 


tcgcagatct 


4680 


gcatctccca 


ggccttgagc 


tcggaggggg 


ggatcatgtc 


cacctgcggg 


gcgatgaaaa 


4740 


aaacggtttc 


cggggcgggg 


gagatgagct 


gggccgaaag 


caggttccgg 


agcagctggg 


4800 


acttgccgca 


accggtgggg 


ccgtagatga 


ccccgatgac 


cggctgcagg 


tggtagttga 


4860 


gggagagaca 


gctgccgtcc 


tcgcggagga 


ggggggccac 


ctcgttcatc 


atctcgcgca 


4920 


catgcatgtt 


ctcgcgcacg 


agttccgcca 


ggaggcgctc 


gccccccagc 


gagaggagct 


4980 


cttgcagcga 


ggcgaagttt 


ttcagcggct 


tgagtccgtc 


ggccatgggc 


attttggaga 


5040 


gggtctgttg 


caagagttcc 


agacggtccc 


agagctcggt 


gatgtgctct 


agggcatctc 


5100 


gatccagcag 


acctcctcgt 


ttcgcgggtt 


ggggcgactg 


cgggagtagg 


gcaccaggcg 


5160 


atgggcgtcc 


agcgaggcca 


gggtccggtc 


cttccagggc 


cgcagggtcc 


gcgtcagcgt 


5220 


ggtctccgtc 


acggtgaagg 


ggtgcgcgcc 


gggctgggcg 


cttgcgaggg 


tgcgcttcag 


5280 


gctcatccgg 


ctggtcgaga 


accgctcccg 


gtcggcgccc 


tgcgcgtcgg 


ccaggtagca 


5340 


attgagcatg 


agttcgtagt 


tgagcgcctc 


ggccgcgtgg 


cccttggcgc 


ggagcttacc 


5400 


tttggaagtg 


tgtccgcaga 


cgggacagag 


gagggacttg 


agggcgtaga 


gcttgggggc 


5460 


gaggaagacg 


gactcggggg 


cgtaggcgtc 


cgcgccgcag 


ctggcgcaga 


cggtctcgca 


5520 


ctccacgagc 


caggtgaggt 


cggggcggtt 


ggggtcaaaa 


acgaggtttc 


ctccgtgctt 


5580 


tttgatgcgt 


ttcttacctc 


tggtctccat 


gagctcgtgt 


ccccgctggg 


tgacaaagag 


5640 


gctgtccgtg 


tccccgtaga 


ccgactttat 


gggccggtcc 


tcgagcgggg 


tgccgcggtc 


5700 


ctcgtcgtag 


aggaaccccg 


cccactccga 


gacgaaggcc 


cgggtccagg 


ccagcacgaa 


5760 


ggaggccacg 


tgggaggggt 


agcggtcgtt 


gtccaccagc 


gggtccacct 


tctccagggt 


5820 


atgcaagcac 


atgtccccct 


cgtccacatc 


caggaaggtg 


attggcttgt 


aagtgtaggc 


5880 
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cacgtgaccg 


ggggtcccgg 

3333 *• ~ — ~ ;y g 


ccgggggggt 

3333333 


ataaaagggg 


gcgggcccct 


gctcgtcctc 


5940 


actgtcttcc 


ggatcgctgt 


ccaggagcqc 


cagctgttgg 

3 3 ""33 


ggtaggtatt 

33 33 


ccctctcgaa 


6000 


qqcqqqcatq 


acctcggcac 


tcaggttgtc 


agtttctaga 


aacgaggagg 

3 33 33 


atttgatatt 


6060 


qacqqtqccq 


ttggagacgc 


ctttcatgag 


cccctcgtcc 


atttggtcag 


aaaagacgat 


6120 


ctttttgttg 


tcgagcttgg 


tggcgaagga 


gccgtagagg 

3 3 ""3~33 


gcgttggaga 


gcagcttggc 

3 3 3 3 


6180 


gatggagcgc 

3 3 3 3 3 


atggtctggt 

33 33 


tcttttcctt 


gtcggcgcgc 

3 33 3 3 


tccttggcgg 


cgatgttgag 


6240 


ctgcacgtac 


tcgcgcgcca 


cgcacttcca 


ttcggggaag 

ZJ ^ ^ Zt 3 


acggtggtga 


gctcgtcggg 


6300 


cacgattctg 


acccgccagc 


cgcggttgtg 


cagggtgatg 


aggtccacgc 


tggtggccac 


6360 


ctcgccgcgc 


aggggctcgt 


tggtccagca 


gaggcgcccg 


cccttgcgcg 


agcagaaggg 


6420 


qqqcagcqqg 

333 3 333 


tccagcatga 


gctcgtcggg 

3 3 ™ 3 3 3 


ggggtcggcg 

3 3 3 3 333 


tccacggtga 


agatgccggg 


6480 


caqqagctcq 

*~3 3 3 ^" ***-3 


gggtcgaagt 


agctgatgca 


ggtgcccaga 


ttgtccagcg 


ccgcttgcca 


6540 


gtcgcgcacg 


gccagcgcgc 

ZJ ZJ ZJ ZJ 


gctcgtaggg 

3^* ~3 3 33 


qctgaqqqqc 


gtgccccagg 

3 3 33 


qcatqggqtq 

3 ""3 33 3 ~3 


6600 


cgtgagcgcg 


gaggcgtaca 


tgccgcagat 

3 3 3 


gtcgtagacg 


tagaggggct 

3 ZJ ZJ ZJ ZJ 


cctcgaggac 


6660 


gccgatgtag 

3 3 3 3 


gtggggtagc 

J ZJ ZJ Z3 ZJ 3 


agcgcccccc 

3 3 


gcggatgctg 

3 33 3 3 


gcgcgcacgt 

3 3 3 3 


agtcgtacag 


6720 


ctcgtgcgag 

3 3 3 3 


ggcgcgagga 

33 , «»3 , " , '3— »33 


gccccgtgcc 


gaggttggag 

3 3 3 3 3 3 


cgttgcggct 

3 3 3 3 


tttcggcgcg 

3 3 3 3 


6780 


gtagacgatc 


tggcggaaga 

3 3 33 3 


tggcgtggga 

33 3 3 3 3 


gttggaggag 

3 *33**33**3 


atggtgggcc 

3 3 333 


tttggaagat 


6840 


gttgaagtgg 

3 3 3 33 


gcgtggggca 

3 3 *3333*~ 


ggccgaccga 


gtccctgatg 


aagtgggcgt 

3 333 3 


aggagtcctg 


6900 


cagcttggcg 


acgagctcgg 


cggtgacgag 

33 ^3 3 3 


gacgtccagg 


gcgcagtagt 


cgagggtctc 


6960 


ttggatgatg 


tcatacttga 


gctggccctt 


ctgcttccac 


agctcgcggt 


tgagaaggaa 


7020 


ctcttcgcgg 


tccttccagt 


actcttcgag 


ggggaacccg 

Z/ mJ Zf mj 


tcctgatcgg 


cacggtaaga 


7080 


gcccaccatg 


tagaactggt 


tgacggcctt 


gtaggcgcag 

3 33 3 3 


cagcccttct 


ccacggggag 


7140 


ggcgtaagct 


tgcgcggcct 

3 3 3 3 


tgcgcaggga 

3 3 3 3 3 


ggtgtgggtg 

33 3 333 3 


agggcgaagg 

3 3 3 3 33 


tgtcgcgcac 


7200 


catgaccttg 


aggaactggt 


gcttgaagtc 


gaggtcgtcg 


cagccgccct 


gctcccagag 


7260 


ttggaagtcc 


gtgcgcttct 


tgtaggcggg 

3 3 3 3 33 


gttaggcaaa 


gcgaaagtaa 


catcgttgaa 


7320 


gaggatcttg 


cccgcgcggg 

3 3 33 3 


gcatgaagtt 


gcgagtgatg 

ZJ ZJ ZJ ZJ 3 


cggaaaggct 


ggggcacctc 


7380 


ggcccggttg 


ttgatgacct 


gggcggcgag 


gacgatctcg 


tcgaagccgt 


tgatgttgtg 

3 3 — 3 3 


7440 


cccgacgatg 


tagagttcca 


cgaatcgcgg 


gcggcccttg 

3 ZJ ZJ 3 


acgtggggca 

3 3333 


gcttcttgag 


7500 


ctcgtcgtag 


gtgagctcgg 


cggggtcgct 

3333 3 


gagcccgtgc 


tgctcgaggg 

3 3 333 


cccagtcggc 


7560 


gacgtqgggg 

3 3 333^3 


ttggcgctga 

3 3 3 3 


ggaaggaagt 


ccagagatcc 


acggccaggg 


cggtctgcaa 


7620 


gcggtcccgg 

3 33 33 


tactgacgga 


actgttggcc 


cacggccatt 


ttttcgggqg 

™*~^"33333 


tgacgcagta 


7680 


gaaggtgcgg 

9 3 3 3 3 3 


gggtcgccgt 


gccagcggtc 

^/ ZJ ZJ 


ccacttgagc 


tggagggcga 


ggtcgtggqc 

33 **^"3 3 33 


7740 


gagctcgacg 


agcggcgggt 

3 3 3 3 3 3 


ccccggagag 


tttcatgacc 


agcatgaagg 


qqacqaqctq 

33 l * , '3 w 3 v 3 


7800 


l l tgccgaag 




agg xgxaggu 


l Lccacaxcg 


raggrgagga 


agagcctttc 


70CA 

/obU 


ggtgcgagga 


tgcgagccga 


tggggaagaa 


ctggatctcc 


tgccaccagt 


tggaggaatg 


7920 


gctgttgatg 


tgatggaagt 


agaaatgccg 


acggcgcgcc 


gagcactcgt 


gcttgtgttt 


7980 
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atacaagcgt 


ccgcagtgct 


cgcaacgctg 


cacgggatgc 


acgtgctgca 


cgagctgtac 


8040 


ctgggttcct 


ttggcgagga 


atttcagtgg 


gcagtggagc 


gctggcggct 


gcatctcgtg 


8100 


ctgtactacg 


tcttggccat 


cggcgtggcc 


atcgtctgcc 


tcgatggtgg 


tcatgctgac 


8160 


gagcccgcgc 


gggaggcagg 


tccagacctc 


ggctcggacg 


ggtcggagag 


cgaggacgag 


8220 


ggcgcgcagg 


ccggagctgt 


ccagggtcct 


gagacgctgc 


ggagtcaggt 


cagtgggcag 


8280 


cggcggcgcg 


cggttgactt 


gcaggagctt 


ttccagggcg 


cgcgggaggt 


ccagatggta 


8340 


cttgatctcc 


acggcgccgt 


tggtggctac 


gtccacggct 


tgcagggtgc 


cgtgcccctg 


8400 


gggcgccacc 


accgtgcccc 


gtttcttctt 


gggcgctgct 


tccatgtcgg 


tcagaagcgg 


8460 


cggcgaggac 


gcgcgccggg 


cggcaggggc 


ggctcggggc 


ccggaggcag 


gggcggcagg 


8520 


ggcacgtcgg 


cgccgcgcgc 


gggcaggttc 


tggtactgcg 


cccggagaag 


actggcgtga 


8580 


gcgacgacgc 


gacggttgac 


gtcctggatc 


tgacgcctct 


gggtgaaggc 


cacgggaccc 


8640 


gtgagtttga 


acctgaaaga 


gagttcgaca 


gaatcaatct 


cggtatcgtt 


gacggcggcc 


8700 


tgccgcagga 


tctcttgcac 


gtcgcccgag 


ttgtcctggt 


aggcgatctc 


ggtcatgaac 


8760 


tgctcgatct 


cctcctcctg 


aaggtctccg 


cggccggcgc 


gctcgacggt 


ggccgcgagg 


8820 


tcgttggaga 


tgcggcccat 


gagctgcgag 


aaggcgttca 


tgccggcctc 


gttccagacg 


8880 


cggctgtaga 


ccacggctcc 


gtcggggtcg 


cgcgcgcgca 


tgaccacctg 


ggcgaggttg 


8940 


agctcgacgt 


ggcgcgtgaa 


gaccgcgtag 


ttgcagaggc 


gctggtagag 


gtagttgagc 


9000 


gtggtggcga 


tgtgctcggt 


gacgaagaag 


tacatgatcc 


agcggcggag 


cggcatctcg 


9060 


ctgacgtcgc 


ccagggcttc 


caagcgttcc 


atggcctcgt 


agaagtccac 


ggcgaagttg 


9120 


aaaaactggg 


agttgcgcgc 


cgagacggtc 


aactcctcct 


ccagaagacg 


gatgagctcg 


9180 


gcgatggtgg 


cgcgcacctc 


gcgctcgaag 


gccccggggg 


gctcctcttc 


catctcctcc 


9240 


tcttcctcct 


ccactaacat 


ctcttctact 


tcctcctcag 


gaggcggtgg 


cgggggaggg 


9300 


gccctgcgtc 


gccggcggcg 


cacgggcaga 


cggtcgatga 


agcgctcgat 


ggtctccccg 


9360 


cgccggcgac 


gcatggtctc 


ggtgacggcg 


cgcccgtcct 


cgcggggccg 


cagcatgaag 


9420 


acgccgccgc 


gcatctccag 


gtggccgccg 


ggggggtctc 


cgttgggcag 


ggagagggcg 


9480 


ctgacgatgc 


atcttatcaa 


ttgacccgta 


gggactccgc 


gcaaggacct 


gagcgtctcg 


9540 


agatccacgg 


gatccgaaaa 


ccgctgaacg 


aaggcttcga 


gccagtcgca 


gtcgcaaggt 


9600 


aggctgagcc 


cggtttcttg 


ttcttcgggt 


atttggtcgg 


gaggcgggcg 


ggcgatgctg 


9660 


ctggtgatga 


agttgaagta 


ggcggtcctg 


agacggcgga 


tggtggcgag 


gagcaccagg 


9720 


tccttgggcc 


cggcttgctg 


gatgcgcaga 


cggtcggcca 


tgccccaggc 


gtggtcctga 


9780 


cacctggcga 


ggtccttgta 


gtagtcctgc 


atgagccgct 


ccacgggcac 


ctcctcctcg 


9840 


cccgcgcggc 


cgtgcatgcg 


cgtgagcccg 


aacccgcgct 


gcggctggac 


gagcgccagg 


9900 


tcggcgacga 


cgcgctcggt 


gaggatggcc 


tgctggatct 


gggtgagggt 


ggtctggaag 


9960 


tcgtcgaagt 


cgacgaagcg 


gtggtaggct 


ccggtgttga 


tggtgtagga 


gcagttggcc 


10020 
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atgacggacc 


agttgacggt 


ctggtggccg 

33 3 3 3 


ggtcgcacga 


gctcgtggta 


cttgaggcgc 


10080 


gagtaggcQC 

3 3 — ^3 3 3 


gcgtgtcgaa 


gatgtagtcg 

3 *~ w 3 3 ^ ^" 3 


ttgcaggcgc 

w 3^"~33 3 


gcacgaggta 

3 3 7 7 


ctggtatccg 


10140 


acgaggaagt 


gcggcggcgg 

3 33 33 33 


ctggcggtag 


agcggccatc 


gctcggtggc 

^ 7777 


gggggcgccg 


10200 


ggcgcgaggt 


cctcgagcat 


gaggcggtgg 

3 33 33 33 


tagccgtaga 


tgtacctgga 


catccaggtg 


10260 


atqccqqcqq 

3 33 33 


cggtggtgga 

^"33 33 33 


ggcgcgcggg 


aactcgcgga 


cgcggttcca 


gatgttgcgc 


10320 


agcggcagga 

3 33 33 


agtagttcat 


ggtggccgcg 

33 33 33 


gtctggcccg 

3 33 3 


tgaggcgcgc 


gcagtcgtgg 


10380 


atgctctaga 


catacgggca 


aaaacgaaag 


cggtcagcgg 

33 3 33 


ctcgactccg 


tggcctggag 

77 777 


10440 


gctaagcgaa 


cgggttgggc 

333 333 


tgcgcgtgta 

3 3 3 3 


ccccggttcg 


aatctcgaat 


caggctggag 

77 77 7 


10500 


ccgcagctaa 


cgtggtactg 


gcactcccgt 


ctcgacccaa 


gcctgctaac 


gaaacctcca 


10560 


gqatacqqaq 

33 ~^»^*3 3~3 


gcgggtcgtt 


ttttggcctt 


ggtcgctggt 


catgaaaaac 


tagtaagcgc 
3 33 


10620 


ggaaagcggc 

3 3 3 ~33 


cgcccgcgat 


ggctcgctgc 

3 3 ' 1 *^"3 w 3 


cgtagtctgg 


agaaagaatc 


gccagggttg 


10680 


cgttgcggtg 

3 **3 "3 3 3 


tgccccggtt 


cgagcctcag 


cgctcggcgc 


cggccggatt 

~3 3 "33 


ccgcggctaa 


10740 


cgtgggcgtg 


gctgccccgt 


cgtttccaag 


accccttagc 


cagccgactt 


ctccagttac 


10800 


ggagcgagcc 

9 7 ZJ ZJ 7 


cctctttttt 


tttcttgtgt 


ttttgccaga 


tgcatcccgt 


actgcggcag 


10860 


atgcgccccc 


accctccacc 


acaaccgccc 


ctaccgcagc 


agcagcaaca 


gccggcgctt 

3 333 


10920 


ctgcccccgc 


cccagcagca 


gccagccact 


accgcggcgg 


ccgccgtgag 


cggagccggc 

"33 3 33 


10980 


gttcagtatg 


acctggcctt 


ggaagagggc 

33 3 333 


gaggggctgg 

ZJ ZJ ZJ ZJ ZJ 33 


cgcggctggg 

3 33 333 


ggcgtcgtcg 


11040 


ccggagcggc 

33 3 33 


acccgcgcgt 


gcagatgaaa 


agggacgctc 


gcgaggccta 


cgtgcccaag 


11100 


cagaacctgt 


tcagagacag 


gagcggcgag 

33 3 3 


gagcccgagg 

ZJ ZJ ZJ ZJ ZJ 


agatgcgcgc 


ctcccgcttc 


11160 


cacgcggggc 


gggagctgcg 


gcgcggcctg 


gaccgaaagc 


gggtgctgag 

333 3 33 


ggacgaggat 


11220 


ttcgaggcgg 


acgagctgac 


ggggatcagc 


cccgcgcgcg 


cgcacgtggc 


cgcggccaac 


11280 


ctggtcacgg 


cgtacgagca 


gaccgtgaag 


gaggagagca 


acttccaaaa 


atccttcaac 


11340 


aaccacgtgc 


gcacgctgat 


cgcgcgcgag 

3 3 3 3 3 


gaggtqaccc 

ZJ ZJ ZJ 7 


tgggcctgat 


gcacctgtgg 


11400 


gacctgctgg 


aggccatcgt 


gcagaacccc 


acgagcaagc 


cgctgacggc 


gcagctgttt 


11460 


ctggtggtgc 

3 3 3 3 3 


agcacagtcg 


ggacaacgag 


acgttcaggg 


aggcgctgct 


gaatatcacc 


11520 


gagcccgagg 


gccgctggct 


cctggacctg 


gtgaacattt 


tgcagagcat 


cgtggtgcag 


11580 


gagcgcgggc 


tgccgctgtc 


cgagaagctg 


gcggccatca 


acttctcggt 


gctgagtctg 


11640 


ggcaagtact 


acgctaggaa 


gatctacaag 


accccgtacg 


tgcccataga 


caaggaqgtg 

ZJ ZJ ZJ ZJ ^3 


11700 


aagatcgacg 


ggttttacat 


gcgcatgacc 


ctgaaagtgc 


tgaccctgag 


cgacgatctg 


11760 


ggggtgtacc 

73 3 3 3 


gcaacgacag 


gatgcaccgc 


gcggtgagcg 

3 33 ^3**3*3 


ccagccgccg 


gcqcqaqctq 


11820 


agcgaccagg 

7 7 ZJ ZJ 


agctgatgca 


cagcctgcag 


cgggccctga 


ccggggccqq 

w ^"3 77 7 w *"37 


qaccqaqqqq 


11880 


gagagctact 


ttgacatggg 


cgcggacctg 


cgctggcagc 


ccagccgccg 


qqccttqqaa 


11940 


gctgccggcg 


gttcccccta 


cgtggaggag 


gtggacgatg 


aggaggagga 


gggcgagtac 


12000 


ctggaagact 


gatggcgcga 


ccgtattttt 


gctagatgca 


gcaacagcca 


ccgccgccgc 


12060 


ctcctgatcc 


cgcgatgcgg 


gcggcgctgc 


agagccagcc gtccggcatt 
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acgattggac 


ccaggccatg 


caacgcatca 


tggcgctgac 


gacccgcaat 


cccgaagcct 


12180 


ttagacagca 


gcctcaggcc 


aaccggctct 


cggccatcct 


ggaggccgtg 


gtgccctcgc 


12240 


gctcgaaccc 


cacgcacgag 


aaggtgctgg 


ccatcgtgaa 


cgcgctggtg 


gagaacaagg 


12300 


ccatccgcgg 


tgacgaggcc 


gggctggtgt 


acaacgcgct 


gctggagcgc 


gtggcccgct 


12360 


acaacagcac 


caacgtgcag 


acgaacctgg 


accgcatggt 


gaccgacgtg 


cgcgaggcgg 


12420 


tgtcgcagcg 


cgagcggttc 


caccgcgagt 


cgaacctggg 


ctccatggtg 


gcgctgaacg 


12480 


ccttcctgag 


cacgcagccc 


gccaacgtgc 


cccggggcca 


ggaggactac 


accaacttca 


12540 


tcagcgcgct 


gcggctgatg 


gtggccgagg 


tgccccagag 


cgaggtgtac 


cagtcggggc 


12600 


cggactactt 


cttccagacc 


agtcgccagg 


gcttgcagac 


cgtgaacctg 


agccaggctt 


12660 


tcaagaactt 


gcagggactg 


tggggcgtgc 


aggccccggt 


cggggaccgc 


gcgacggtgt 


12720 


cgagcctgct 


gacgccgaac 


tcgcgcctgc 


tgctgctgct 


ggtggcgccc 


ttcacggaca 


12780 


gcggcagcgt 


gagccgcgac 


tcgtacctgg 


gctacctgct 


taacctgtac 


cgcgaggcca 


12840 


tcggacaggc 


gcacgtggac 


gagcagacct 


accaggagat 


cacccacgtg 


agccgcgcgc 


12900 


tgggccagga 


ggacccgggc 


aacctggagg 


ccaccctgaa 


cttcctgctg 


accaaccggt 


12960 


cgcagaagat 


cccgccccag 


tacgcgctga 


gcaccgagga 


ggagcgcatc 


ctgcgctacg 


13020 


tgcagcagag 


cgtggggctg 


ttcctgatgc 


aggagggggc 


cacgcccagc 


gcggcgctcg 


13080 


acatgaccgc 


gcgcaacatg 


gagcccagca 


tgtacgcccg 


caaccgcccg 


ttcatcaata 


13140 


agctgatgga 


ctacttgcat 


cgggcggccg 


ccatgaactc 


ggactacttt 


accaacgcca 


13200 


tcttgaaccc 


gcactggctc 


ccgccgcccg 


ggttctacac 


gggcgagtac 


gacatgcccg 


13260 


accccaacga 


cgggttcctg 


tgggacgacg 


tggacagcag 


cgtgttctcg 


ccgcgtccag 


13320 


gaaccaatgc 


cgtgtggaag 


aaagagggcg 


gggaccggcg 


gccgtcctcg 


gcgctgtccg 


13380 


gtcgcgcggg 


tgctgccgcg 


gcggtgcccg 


aggccgccag 


ccccttcccg 


agcctgccct 


13440 


tttcgctgaa 


cagcgtgcgc 


agcagcgagc 


tgggtcggct 


gacgcgaccg 


cgcctgctgg 


13500 


gcgaggagga 


gtacctgaac 


gactccttgt 


tgaggcccga 


gcgcgagaag 


aacttcccca 


13560 


ataacgggat 


agagagcctg 


gtggacaaga 


tgagccgctg 


gaagacgtac 


gcgcacgagc 


13620 


acagggacga 


gccccgagct 


agcagcgcag 


gcacccgtag 


acgccagcgg 


cacgacaggc 


13680 


agcggggact 


ggtgtgggac 


gatgaggatt 


ccgccgacga 


cagcagcgtg 


ttggacttgg 


13740 


gtgggagtgg 


tggtaacccg 


ttcgctcacc 


tgcgcccccg 


tatcgggcgc 


ctgatgtaag 


13800 


aatctgaaaa 


aataaaagac 


ggtactcacc 


aaggccatgg 


cgaccagcgt 


gcgttcttct 


13860 


ctgttgtttg 


tagtagtatg 


atgaggcgcg 


tgtacccgga 


gggtcctcct 


ccctcgtacg 


13920 


agagcgtgat 


gcagcaggcg 


gtggcggcgg 


cgatgcagcc 


cccgctggag 


gcgccttacg 


13980 


tgcccccgcg 


gtacctggcg 


cctacggagg 


ggcggaacag 


cattcgttac 


tcggagctgg 


14040 


cacccttgta 


cgataccacc 


cggttgtacc 


tggtggacaa 


caagtcggca 


gacatcgcct 


14100 


cgctgaacta 


ccagaacgac 


cacagcaact 


tcctgaccac 


cgtggtgcag 


aacaacgatt 


14160 
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tcacccccac 


qqaqqccaqc 


acccagacca 


tcaactttga 


cgagcgctcg 


cggtggggcg 


14220 


qccaqctqaa 


aaccatcatg 


cacaccaaca 


tgcccaacgt 


gaacgagttc 


atgtacagca 


14280 


acaagttcaa 


ggcgcgggtg 

33 3 333 3 


atggtctcgc 


gcaagacccc 


caacggggtg 


gatgatgatt 


14340 


atqatqgtag 

3 3 3 27 


tcaggacgag 


ctgacctacg 


agtgggtgga 


gtttgagctg 


cccgagggca 


14400 


acttctcggt 


gaccatgacc 


atcgatctga 


tgaacaacgc 


catcatcgac 


aactacttgg 


14460 


cggtggggcg 


gcagaacggg 


gtgctggaga 


gcgacatcgg 


cgtgaagttc 


gacacgcgca 


14520 


acttccggct 


gggctgggac 


cccgtgaccg 


agctggtgat 


gccgggcgtg 


tacaccaacg 


14580 


aggccttcca 


ccccgacatc 


gtcctgctgc 


ccggctgcgg 


cgtggacttc 


accgagagcc 


14640 


gcctcagcaa 


cctgctgggc 

3 3 3^ 


atccgcaagc 


ggcagccctt 


ccaggagggc 


ttccagatcc 


14700 


tqtacgagqa 


cctggagggg 

33 3 3 3 3 


ggcaacatcc 

3 3 


ccgcgctctt 


ggatgtcgaa 


gcctacgaga 


14760 


aaagcaagga 


ggatagcacc 


gccgcggcga 


ccgcagccgt 


ggccaccgcc 


tctaccgagg 


14820 


tqcqqqqcqa 


taattttgct 


agcgctgcgg 

^^3 3 3 3^ 


cagcggccga 


ggcggctgaa 


accgaaagta 


14880 


agatagtcat 


ccagccggtg 


gagaaggaca 


gcaaggacag 


gagctacaac 


gtgctcgcgg 


14940 


acaagaaaaa 


caccgcctac 


cgcagctggt 


acctggccta 


caactacggc 


gaccccgaga 


15000 


agqqcgtqcq 


ctcctggacg 


ctgctcacca 


cctcggacgt 


cacctgcggc 


gtggagcaag 


15060 


tctactggtc 


gctgcccgac 


atgatgcaag 


acccggtcac 


cttccgctcc 


acgcgtcaag 


15120 


ttagcaacta 


cccggtggtg 


ggcgccgagc 


tcctgcccgt 


ctactccaag 


agcttcttca 


15180 


acgagcaggc 


cgtctactcg 


cagcagctgc 


gcgccttcac 


ctcgctcacg 


cacgtcttca 


15240 


accgcttccc 


cgagaaccag 


atcctcgtcc 


gcccgcccgc 


gcccaccatt 


accaccgtca 


15300 


gtgaaaacgt 


tcctgctctc 


acagatcacg 


ggaccctgcc 


gctgcgcagc 


agtatccggg 


15360 


gagtccagcg 


cgtgaccgtc 


actgacgcca 


gacgccgcac 


ctgcccctac 


gtctacaagg 


15420 


ccctgggcgt 


agtcgcgccg 


cgcgtcctct 


cgagccgcac 


cttctaaaaa 


atgtccattc 


15480 


tcatctcgcc 


cagtaataac 


accggttggg 


gcctgcgcgc 


gcccagcaag 


atgtacggag 


15540 


gcgctcgcca 


acgctccacg 


caacaccccg 


tgcgcgtgcg 


cgggcacttc 


cgcgctccct 


15600 


ggggcgccct 


caagggccgc 

333 — ' 


gtgcgctcgc 


gcaccaccgt 


cgacgacgtg 


atcgaccagg 


15660 


tqgtggccga 

33 w 33 ^"3 


cgcgcgcaac 


tacacgcccg 


ccgccgcgcc 


cgtctccacc 


gtggacgccg 


15720 


tcatcgacag 


cgtggtggcc 


gacgcgcgcc 


ggtacgcccg 


caccaagagc 


cggcggcggc 


15780 


gcatcgcccg 


gcggcaccgg 


agcacccccg 


ccatgcgcgc 


ggcgcgagcc 


ttgctgcgca 


15840 


gggccaggcg 

3 3 3 3 3 3 


cacgggacgc 


agggccatgc 


tcagggcggc 


cagacgcgcg 


gcctccggca 


15900 


gcagcagcgc 


cggcaggacc 


cgcagacgcg 


cggccacggc 


ggcggcggcg 


gccatcgcca 


15960 


gcatgtcccg 


cccgcggcgc 


ggcaacgtgt 


actgggtgcg 


cgacgccgcc 


accggtgtgc 


16020 


gcgtgcccgt 


gcgcacccgc 


ccccctcgca 


cttgaagatg 


ctgacttcgc 


gatgttgatg 


16080 


tgtcccagcg 


gcgaggagga 


tgtccaagcg 


caaatacaag 


gaagagatgc 


tccaggtcat 


16140 


cgcgcctgag 


atctacggcc 


ccgcggcggc 


ggtgaaggag 


gaaagaaagc 


cccgcaaact 


16200 


gaagcgggtc 


aaaaaggaca 


aaaaggagga 


ggaagatgac ggactggtgg 
Page 26 


agtttgtgcg 


16260 



WO 2005/001103 



PCT/US2004/016614 



it »w* 



cgagttcgcc ccccggcggc 


gcgtgcagtg 


gcgcgggcgg 


aaagtgaaac 


cggtgctgcg 


16320 


gcccggcacc acggtggtct 


tcacgcccgg 


cgagcgttcc 


ggctccgcct 


ccaagcgctc 


16380 


ctacgacgag gtgtacgggg 


acgaggacat 


cctcgagcag 


gcggtcgagc 


gtctgggcga 


16440 


gtttgcgtac ggcaagcgca 


gccgccccgc 


gcccttgaaa 


gaggaggcgg 


tgtccatccc 


16500 


gctggaccac ggcaacccca 


cgccgagcct 


gaagccggtg 


accctgcagc 


aggtgctacc 


16560 


gagcgcggcg ccgcgccggg 


gcttcaagcg 


cgagggcggc 


gaggatctgt 


acccgaccat 


16620 


gcagctgatg gtgcccaagc 


gccagaagct 


ggaggacgtg 


ctggagcaca 


tgaaggtgga 


16680 


ccccgaggtg cagcccgagg 


tcaaggtgcg 


gcccatcaag 


caggtggccc 


cgggcctggg 


16740 


cgtgcagacc gtggacatca 


agatccccac 


ggagcccatg 


gaaacgcaga 


ccgagcccgt 


16800 


gaagcccagc accagcacca 


tggaggtgca 


gacggatccc 


tggatgccag 


caccagcttc 


16860 


caccagcact cgccgaagac 


gcaagtacgg 


cgcggccagc 


ctgctgatgc 


ccaactacgc 


16920 


gctgcatcct tccatcatcc 


ccacgccggg 


ctaccgcggc 


acgcgcttct 


accgcggcta 


16980 


caccagcagc cgccgccgca 


agaccaccac 


ccgccgccgt 


cgtcgcagcc 


gccgcagcag 


17040 


caccgcgact tccgccttgg 


tgcggagagt 


gtatcgcagc 


gggcgcgagc 


ctctgaccct 


17100 


gccgcgcgcg cgctaccacc 


cgagcatcgc 


catttaacta 


ccgcctccta 


cttgcagata 


17160 


tggccctcac atgccgcctc 


cgcgtcccca 


ttacgggcta 


ccgaggaaga 


aagccgcgcc 


17220 


gtagaaggct gacggggaac 


gggctgcgtc 


gccatcacca 


ccggcggcgg 


cgcgccatca 


17280 


gcaagcggtt ggggggaggc 


ttcctgcccg 


cgctgatccc 


catcatcgcc 


gcggcgatcg 


17340 


gggcgatccc cggcatagct 


tccgtggcgg 


tgcaggcctc 


tcagcgccac 


tgagacacaa 


17400 


aaaagcatgg atttgtaata 


aaaaaaaaaa 


tggactgacg 


ctcctggtcc 


tgtgatgtgt 


17460 


gtttttagat ggaagacatc 


aatttttcgt 


ccctggcacc 


gcgacacggc 


acgcggccgt 


17520 


ttatgggcac ctggagcgac 


atcggcaaca 


gccaactgaa 


cgggggcgcc 


ttcaattgga 


17580 


gcagtctctg gagcgggctt 


aagaatttcg 


ggtccacgct 


caaaacctat 


ggcaacaagg 


17640 


cgtggaacag cagcacaggg 


caggcgctga 


gggaaaagct 


gaaagaacag 


aacttccagc 


17700 


agaaggtggt tgatggcctg 


gcctcaggca 


tcaacggggt 


ggttgacctg 


gccaaccagg 


17760 


ccgtgcagaa acagatcaac 


agccgcctgg 


acgcggtccc 


gcccgcgggg 


tccgtggaga 


17820 


tgccccaggt ggaggaggag 


ctgcctcccc 


tggacaagcg 


cggcgacaag 


cgaccgcgtc 


17880 


ccgacgcgga ggagacgctg 


ctgacgcaca 


cggacgagcc 


gcccccgtac 


gaggaggcgg 


17940 


tgaaactggg cctgcccacc 


acgcggcccg 


tggcgcctct 


ggccaccgga 


gtgctgaaac 


18000 


ccagcagcag ccagcccgcg 


accctggact 


tgcctccgcc 


tcgcccctcc 


acagtggcta 


18060 


agcccctgcc gccggtggcc 


gtcgcgtcgc 


gcgccccccg 


aggccgcccc 


caggcgaact 


18120 


ggcagagcac tctgaacagc 


atcgtgggtc 


tgggagtgca 


gagtgtgaag 


cgccgccgct 


18180 


gctattaaaa gacactgtag 


cgcttaactt 


gcttgtctgt 


gtgtatatgt 


atgtccgccg 


18240 


accagaagga ggagtgtgaa 


gaggcgcgtc 


gccgagttgc 


aagatggcca 


ccccatcgat 


18300 
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gctgccccag tgggcgtaca tgcacatcgc cggacaggac gcttcggagt acctgagtcc 18360 

gggtctggtg cagttcgccc gcgccacaga cacctacttc agtctgggga acaagtttag 18420 

gaaccccacg gtggcgccca cgcacgatgt gaccaccgac cgcagccagc ggctgacgct 18480 

gcgcttcgtg cccgtggacc gcgaggacaa cacctactcg tacaaagtgc gctacacgct 18540 

ggccgtgggc gacaaccgcg tgctggacat ggccagcacc tactttgaca tccgcggcgt 18600 

gctggaccgg ggccctagct tcaaacccta ctctggcacc gcctacaaca gcctagctcc 18660 

caagggagct cccaattcca gccagtggga gcaagcaaaa acaggcaatg ggggaactat 18720 

ggaaacacac acatatggtg tggccccaat gggcggagag aatattacaa aagatggtct 18780 

tcaaattgga actgacgtta cagcgaatca gaataaacca atttatgccg acaaaacatt 18840 

tcaaccagaa ccgcaagtag gagaagaaaa ttggcaagaa actgaaaact tttatggcgg 18900 

tagagctctt aaaaaagaca caaacatgaa accttgctat ggctcctatg ctagacccac 18960 

caatgaaaaa ggaggtcaag ctaaacttaa agttggagat gatggagttc caaccaaaga 19020 

attcgacata gacctggctt tctttgatac tcccggtggc accgtgaacg gtcaagacga 19080 

gtataaagca gacattgtca tgtataccga aaacacgtat ttggaaactc cagacacgca 19140 

tgtggtatac aaaccaggca aggatgatgc aagttctgaa attaacctgg ttcagcagtc 19200 

tatgcccaac agacccaact acattgggtt cagggacaac tttatcggtc ttatgtacta 19260 

caacagcact ggcaatatgg gtgtgcttgc tggtcaggcc tcccagctga atgctgtggt 19320 

tgatttgcaa gacagaaaca ccgagctgtc ctaccagctc ttgcttgact ctttgggtga 19380 

cagaacccgg tatttcagta tgtggaacca ggcggtggac agttatgacc ccgatgtgcg 19440 

catcatcgaa aaccatggtg tggaggatga attgccaaac tattgcttcc ccttggacgg 19500 

ctctggcact aacgccgcat accaaggtgt gaaagtaaaa gatggtcaag atggtgatgt 19560 

tgagagtgaa tgggaaaatg acgatactgt tgcagctcga aatcaattat gtaaaggtaa 19620 

cattttcgcc atggagatta atctccaggc taacctgtgg agaagtttcc tctactcgaa 19680 

cgtggccctg tacctgcccg actcctacaa gtacacgccg accaacgtca cgctgccgac 19740 

caacaccaac acctacgatt acatgaatgg cagagtgaca cctccctcgc tggtagacgc 19800 

ctacctcaac atcggggcgc gctggtcgct ggaccccatg gacaacgtca accccttcaa 19860 

ccaccaccgc aacgcgggcc tgcgctaccg ctccatgctc ctgggcaacg ggcgctacgt 19920 

gcccttccac atccaggtgc cccaaaagtt tttcgccatc aagagcctcc tgctcctgcc 19980 

cgggtcctac acctacgagt ggaacttccg caaggacgtc aacatgatcc tgcagagctc 20040 

cctaggcaac gacctgcgca cggacggggc ctccatcgcc ttcaccagca tcaacctcta 20100 

cgccaccttc ttccccatgg cgcacaacac cgcctccacg ctcgaggcca tgctgcgcaa 20160 

cgacaccaac gaccagtcct tcaacgacta cctctcggcg gccaacatgc tctaccccat 20220 

cccggccaac gccaccaacg tgcccatctc catcccctcg cgcaactggg ccgccttccg 20280 

cggatggtcc ttcacgcgcc tgaagacccg cgagacgccc tcgctcggct ccgggttcga 20340 

cccctacttc gtctactcgg gctccatccc ctacctagac ggcaccttct acctcaacca 20400 
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caccttcaag 


aaggtctcca 


tcaccttcga 


ctcctccgtc 


agctggcccg 


gcaacgaccg 


20460 


cctcctgacg 


cccaacgagt 


tcgaaatcaa 


gcgcaccgtc 


gacggagagg 


gatacaacgt 


20520 


ggcccagtgc 


aacatgacca 


aggactggtt 


cctggtccag 


atgctggccc 


actacaacat 


20580 


cggctaccag 


ggcttctacg 


tgcccgaggg 


ctacaaggac 


cgcatgtact 


ccttcttccg 


20640 


caacttccag 


cccatgagcc 


gccaggtcgt 


ggacgaggtc 


aactacaagg 


actaccaggc 


20700 


cgtcaccctg 


gcctaccagc 


acaacaactc 


gggcttcgtc 


ggctacctcg 


cgcccaccat 


20760 


gcgccagggc 


cagccctacc 


ccgccaacta 


cccctacccg 


ctcatcggca 


agagcgccgt 


20820 


cgccagcgtc 


acccagaaaa 


agttcctctg 


cgaccgggtc 


atgtggcgca 


tccccttctc 


20880 


cagcaacttc 


atgtccatgg 


gcgcgctcac 


cgacctcggc 


cagaacatgc 


tctacgccaa 


20940 


ctccgcccac 


gcgctagaca 


tgaatttcga 


agtcgacccc 


atggatgagt 


ccacccttct 


21000 


ctatgttgtc 


ttcgaagtct 


tcgacgtcgt 


ccgagtgcac 


cagccccacc 


gcggcgtcat 


21060 


cgaagccgtc 


tacctgcgca 


cgcccttctc 


ggccggcaac 


gccaccacct 


aagccgctct 


21120 


tgcttcttgc 


aagatgacgg 


cgggctccgg 


cgagcaggag 


ctcagggcca 


tcctccgcga 


21180 


cctgggctgc 


gggccctgct 


tcctgggcac 


cttcgacaag 


cgcttccctg 


gattcatggc 


21240 


cccgcacaag 


ctggcctgcg 


ccatcgtgaa 


cacggccggc 


cgcgagaccg 


ggggcgagca 


21300 


ctggctggcc 


ttcgcctgga 


acccgcgctc 


ccacacatgc 


tacctcttcg 


accccttcgg 


21360 


gttctcggac 


gagcgcctca 


agcagatcta 


ccagttcgag 


tacgagggcc 


tgctgcgtcg 


21420 


cagcgccctg 


gccaccgagg 


accgctgcgt 


caccctggaa 


aagtccaccc 


agaccgtgca 


21480 


gggtccgcgc 


tcggccgcct 


gcgggctctt 


ctgctgcatg 


ttcctgcacg 


ccttcgtgca 


21540 


ctggcccgac 


cgccccatgg 


acaagaaccc 


caccatgaac 


ttactgacgg 


gggtgcccaa 


21600 


cggcatgctc 


cagtcgcccc 


aggtggaacc 


caccctgcgc 


cgcaaccagg 


aagcgctcta 


21660 


ccgcttcctc 


aatgcccact 


ccgcctactt 


tcgctcccac 


cgcgcgcgca 


tcgagaaggc 


21720 


caccgccttc 


gaccgcatga 


atcaagacat 


gtaaaaaacc 


ggtgtgtgta 


tgtgaatgct 


21780 


ttattcataa 


taaacagcac 


atgtttatgc 


caccttctct 


gaggctctga 


ctttatttag 


21840 


aaatcgaagg 


ggttctgccg 


gctctcggca 


tggcccgcgg 


gcagggatac 


gttgcggaac 


21900 


tggtacttgg 


gcagccactt 


gaactcgggg 


atcagcagct 


tgggcacggg 


gaggtcgggg 


21960 


aacgagtcgc 


tccacagctt 


gcgcgtgagt 


tgcagggcgc 


ccagcaggtc 


gggcgcggag 


22020 


atcttgaaat 


cgcagttggg 


acccgcgttc 


tgcgcgcgag 


agttgcggta 


cacggggttg 


22080 


cagcactgga 


acaccatcag 


ggccgggtgc 


ttcacgcttg 


ccagcaccgt 


cgcgtcggtg 


22140 


atgccctcca 


cgtccagatc 


ctcggcgttg 


gccatcccga 


agggggtcat 


cttgcaggtc 


22200 


tgccgcccca 


tgctgggcac 


gcagccgggc 


ttgtggttgc 


aatcgcagtg 


cagggggatc 


22260 


agcatcatct 


gggcctgctc 


ggagctcatg 


cccgggtaca 


tggccttcat 


gaaagcctcc 


22320 


agctggcgga 


aggcctgctg 


cgccttgccg 


ccctcggtga 


agaagacccc 


gcaggacttg 


22380 


ctagagaact 


ggttggtggc 


gcagccggcg 


tcgtgcacgc 


agcagcgcgc 


gtcgttgttg 


22440 
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qccaqctqca 


ccacgctgcg 


cccccagcgg 


ttctgggtga 


tcttggcccg 


gttggggttc 


22500 


tccttcagcg 


cgcgctgccc 


gttctcgctc 


gccacatcca 


tctcgatagt 


gtgctccttc 


22560 


tggatcatca 


cggtcccgtg 


caggcaccgc 


agcttgccct 


eggctteggt 


gcagccgtgc 


22620 


agccacagcg 


cgcagccggt 

ZJ 9 ^/ ZJ 


gcactcccag 


ttcttgtggg 


cgatctggga 


gtgcgagtgc 


22680 


acgaagccct 


geaggaageg 

9 3 9 ZJ ZJ 


gcccatcatc 


gcggtcaggg 

9 ZJ <J *J ZJ -J 


tcttgttgct 


ggtgaaggtc 


22740 


agcgggatgc 

ZJ Z? ZJ ZJ ~9 


cgcggtgctc 

9 9 9 ZJ 


ctcgttcaca 


tacaggtggc 


agatgeggeg 


gtacacctcg 


22800 


ccctgctcgg 


gcatcagctg 


gaaggeggae 

9 9 9 9 9 


ttcaggtege 


tctccacgcg 


gtaccggtcc 


22860 


atcagcagcg 


tcatcacttc 


catgcccttc 


tcccaggccg 


aaacgategg 


caggctcagg 


22920 


gggttcttca 


ccgccattgt 


catcttagtc 


gccgccgccg 

9 9 9 9 


aggtcagggg 


gtcgttctcg 


22980 


tccagggtct 


caaacactcg 


cttgccgtcc 

9 9 


ttctcgatga 


tgcgcacggg 


gggaaagctg 


23040 


aagcccacgg 


ccgccagctc 


ctcctcggcc 


tgcctttcgt 


cctcgctgtc 


ctggctgatg 


23100 


tcttgcaaag 


geacatgett 


ggtcttgegg 


ggtttctttt 


tgggcggcag 


aggeggegge 


23160 


gatgtgctgg 


gagagegega 


gttctcgttc 


accacgacta 


tttcttcttc 


ttggccgtcg 


23220 


tccgagacca 


cgcggcggta 

ZJ ZJ 9 ZJ ZJ 


ggcatgcctc 


ttctggggca 


gaggeggagg 


cgacgggctc 


23280 


tcgcggttcg 

j ZJ ZJ 9 


gegggegget 

9 9 9 9 9 9 


ggcagagccc 


cttccgcgtt 


cgggggtgcg 


ctcctggcgg 


23340 


cgctgctctg 


actgacttcc 


tccgcggccg 


gccattgtgt 


tctcctaggg 


agcaacaaca 


23400 


agcatggaga 


ctcagccatc 


gtcgccaaca 


tcgccatctg 


cccccgccgc 


caccgccgac 


23460 


gagaaccagc 


agcagaatga 


aagcttaacc 


gccccgccgc 


ccagccccac 


ctccgacgcc 


23520 


gcggccccag 


acatgeaaga 


gatggaggaa 


tccatcgaga 


ttgacctggg 


ctacgtgacg 


23580 


cccgcggagc 


acgaggagga 

9 9 9 9 9 


gctggcagcg 


cgcttttcag 


ccccggaaga 


gaaccaccaa 


23640 


gagcagccag 


agcaggaagc 


agagaacgag 


cagaaccagg 


ctgggcacga 


geatggegae 


23700 


tacctgagcg 


gggcagagga 

9 9 9 9 99 


cgtgctcatc 


aagcatctgg 


cccgccaatg 


catcatcgtc 


23760 


aaggacgcgc 


tgctcgaccg 


cgccgaggtg 


cccctcagcg 


tggeggaget 


cagccgcgcc 


23820 


tacgagcgca 


acctcttctc 


gccgcgcgtg 


ccccccaagc 


gccagcccaa 


cggcacctgt 


23880 


gagcccaacc 


cgcgcctcaa 


cttctacccg 


gtcttcgegg 


tgcccgaggc 


cctggccacc 


23940 


taccacctct 


ttttcaagaa 


ccaaaggatc 


cccgtctcct 


gccgcgccaa 


ccgcacccgc 


24000 


gccgacgccc 


tgctcaacct 


gggccccggc 


gcccgcctac 


ctgatatcac 


ctccttggaa 


24060 


gaggttccca 


agatcttcga 

9 9 


gggtctgggc 

9 9 9 9 9 9 


agegacgaga 

9 9 ~J —* 


ctcgggccgc 


gaacgctctg 


24120 


caaggaagcg 


gagaggagca 


tgagcaccac 


agcgccctgg 


tggagttgga 


aggegacaac 


24180 


gcgcgcctgg 


cggtcctcaa 


gcgcacggtc 


gagctgaccc 


acttcgccta 


cccggcgctc 


24240 


aacctgcccc 


ccaaggtcat 


gagcgccgtc 


atggaccagg 

99 9 9 


tgctcatcaa 


gcgcgcctcg 

9 9^9^"^" 9 


24300 


cccctctcgg 


aggaggagat 


gcaggacccc 


gagagttegg 

9 9 9 99 


acgagggcaa 


gcccgtqgtc 

9 9 99 


24360 


a/if na "a /~ 

ag eg a eg age 


age tggcgcg 


c eggceggga 


gegag eagea 


eeeeeeagag 


+• <"i ^ <^ *^ /*i "5 /i 

cctggaagag 




cggcgcaagc 


tcatgatggc 


cgtggtcctg 


gtgaccgtgg 


agctggagtg 


tctgcgccgc 


24480 


ttctttgccg 


aegeggagae 


cctgcgcaag 


gtcgaggaga 


acctgcacta 


cctcttcagg 


24540 
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cacgggttcg 


tgcgccaggc 


ctgcaagatc 


tccaacgtgg 


agctgaccaa 


cctggtctcc 


24600 


tacatgggca 


tcctgcacga 


gaaccgcctg 


gggcaaaacg 


tgctgcacac 


caccctgcgc 


24660 


ggggaggccc 


gccgcgacta 


catccgcgac 


tgcgtctacc 


tgtacctctg 


ccacacctgg 


24720 


cagacgggca 


tgggcgtgtg 


gcagcagtgc 


ctggaggagc 


agaacctgaa 


agagctctgc 


24780 


aagctcctgc 


agaagaacct 


caaggccctg 


tggaccgggt 


tcgacgagcg 


taccaccgcc 


24840 


tcggacctgg 


ccgacctcat 


cttccccgag 


cgcctgcggc 


tgacgctgcg 


caacgggctg 


24900 


cccgacttta 


tgagccaaag 


catgttgcaa 


aactttcgct 


ctttcatcct 


cgaacgctcc 


24960 


gggatcctgc 


ccgccacctg 


ctccgcgctg 


ccctcggact 


tcgtgccgct 


gaccttccgc 


25020 


gagtgccccc 


cgccgctctg 


gagccactgc 


tacttgctgc 


gcctggccaa 


ctacctggcc 


25080 


taccactcgg 


acgtgatcga 


ggacgtcagc 


ggcgagggtc 


tgctggagtg 


ccactgccgc 


25140 


tgcaacctct 


gcacgccgca 


ccgctccctg 


gcctgcaacc 


cccagctgct 


gagcgagacc 


25200 


cagatcatcg 


gcaccttcga 


gttgcaaggc 


cccggcgacg 


gcgagggcaa 


ggggggtctg 


25260 


aaactcaccc 


cggggctgtg 


gacctcggcc 


tacttgcgca 


agttcgtgcc 


cgaggactac 


25320 


catcccttcg 


agatcaggtt 


ctacgaggac 


caatcccagc 


cgcccaaggc 


cgagctgtcg 


25380 


gcctgcgtca 


tcacccaggg 


ggccatcctg 


gcccaattgc 


aagccatcca 


gaaatcccgc 


25440 


caagaatttc 


tgctgaaaaa 


gggccacggg 


gtctacttgg 


acccccagac 


cggagaggag 


25500 


ctcaacccca 


gcttccccca 


ggatgccccg 


aggaagcagc 


aagaagctga 


aagtggagct 


25560 


gccgccgccg 


gaggatttgg 


aggaagactg 


ggagagcagt 


caggcagagg 


aggaggagat 


25620 


ggaagactgg 


gacagcactc 


aggcagagga 


ggacagcctg 


caagacagtc 


tggaggagga 


25680 


agacgaggtg 


gaggaggcag 


aggaagaagc 


agccgccgcc 


agaccgtcgt 


cctcggcgga 


25740 


gaaagcaagc 


agcacggata 


ccatctccgc 


tccgggtcgg 


ggtcgcggcg 


gccgggccca 


25800 


cagtaggtgg 


gacgagaccg 


ggcgcttccc 


gaaccccacc 


acccagaccg 


gtaagaagga 


25860 


gcggcaggga 


tacaagtcct 


ggcgggggca 


caaaaacgcc 


atcgtctcct 


gcttgcaagc 


25920 


ctgcgggggc 


aacatctcct 


tcacccggcg 


ctacctgctc 


ttccaccgcg 


gggtgaactt 


25980 


cccccgcaac 


atcttgcatt 


actaccgtca 


cctccacagc 


ccctactact 


gtttccaaga 


26040 


agaggcagaa 


acccagcagc 


agcagaaaac 


cagcggcagc 


agcagctaga 


aaatccacag 


26100 


cggcggcagg 


tggactgagg 


atcgcggcga 


acgagccggc 


gcagacccgg 


gagctgagga 


26160 


accggatctt 


tcccaccctc 


tatgccatct 


tccagcagag 


tcgggggcag 


gagcaggaac 


26220 


tgaaagtcaa 


gaaccgttct 


ctgcgctcgc 


tcacccgcag 


ttgtctgtat 


cacaagagcg 


26280 


aagaccaact 


tcagcgcact 


ctcgaggacg 


ccgaggctct 


cttcaacaag 


tactgcgcgc 


26340 


tcactcttaa 


agagtagccc 


gcgcccgccc 


acacacggaa 


aaaggcggga 


attacgtcac 


26400 


cacctgcgcc 


cttcgcccga 


ccatcatgag 


caaagagatt 


cccacgcctt 


acatgtggag 


26460 


ctaccagccc 


cagatgggcc 


tggccgccgg 


cgccgcccag 


gactactcca 


cccgcatgaa 


26520 


ctggctcagt 


gccgggcccg 


cgatgatctc 


acgggtgaat 


gacatccgcg 


cccaccgaaa 


26580 
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ccagatactc 


ctagaacagt 


cagcgatcac 


cgccacgccc 


cgccatcacc 


ttaatccgcg 


26640 


taattqqccc 


qccqccctqq 


tgtaccagga 


aattccccag 


cccacgaccg 


tactacttcc 


26700 


acaaaacacc 


caggecgaag 


tccagctgac 


taactcaggt 


gtccagctgg 


ccggcggcgc 


26760 


caccctatat 


cgtcaccgcc 


ccqctcaqgg 


tataaagegg 


ctggtgatcc 


gaggcagagg 


26820 


cacacaqctc 


aacqacqaqq 


tqqtgagctc 

33 3 3 


ttcgctgggt 

3 333 


ctgcgacctg 


aeggagtett 


26880 


ccaactcacc 


qqatcqqqqa 


gatcttcctt 


cacgcctcgt 


caggccgtcc 


tgactttgga 


26940 


gagttcgtcc 


tcgcagcccc 


qctcqqgcgg 


catcggcact 


ctccagttcg 


tggaggagtt 


27000 


cactccctcg 


gtctacttca 


accccttctc 


cggctccccc 


ggccactacc 


eggacgagtt 


27060 


catcccgaac 


ttcgacgcca 


teagegagtc 


ggtggacggc 

33 ^^ ^s 


tacgattgaa 


tgtcccatgg 


27120 


taacacaact 


gacctagctc 


ggcttcgaca 


cctggaccac 


tgccgccgct 


tccgctgctt 


27180 


ca etc a a a at 


ctcqccqaqt 


ttgectaett 


tgagctgccc 


gaggagcacc 


ctcagggccc 


27240 


aacccacaaa 


qtqcqqatca 


tegtcgaagg 


gggcctcgac 


tcccacctgc 


tteggatett 


27300 


cagccagcga 


ccgatcctgg 


tegagegega 


acaaggacag 


acccttctta 


ctttgtactg 


27360 


catctgcaac 


caccccggcc 


tqcatgaaag 

3 3 ~~~~~3 


tctttgttgt 


ctgctgtgta 


ctgagtataa 


27420 


taaaagctga 


qatcageqac 


tactccggac 


tcgattgtgg 

3 3 33 


tgttcctgct 


ateaaceggt 


27480 


ccctgttctt 


caccqqqaac 


qaqaccqaqc 


tccagctcca 


gtgtaagccc 


cacaagaagt 


27540 


acctcaccta 


gctgttccag 


qqctccccqa 

3 3 3 


tcqccqttqt 


caaccactgc 


gacaacgacg 


27600 


gagtcctget 


qaqcqgccct 

77 77 77 l7 


gccaacctta 


ctttttccac 


ccgcagaagc 


aagctccagc 


27660 


tcttccaacc 


cttcctcccc 


gggacctatc 


agtgcgtctc 


aggaccctgc 


catcacacct 


27720 


tccacctgat 


cccgaatacc 


acagcgccgc 


tccccgctac 


taacaaccaa 


actacccacc 


27780 


aacgccaccg 


tcgcgacctt 


tcctctgaat 


ctaataccac 


taceggaggt 


gagctccgag 


27840 


gtcgaccaac 


ctctqqqatt 


tactacggcc 


cctggqaggt 


ggtggggtta 


atagegctag 


27900 


gcctagttgc 


qqqtqqqctt 


ttggttctct 


gctacctata 


cctcccttgc 


tgttcgtact 


27960 


taqtqqtqct 


qtqttqctqq 


tttaagaaat 


ggggaagatc 


accctagtga 


gctgcggtgc 

3 9 99 9 


28020 


gctgqtggcg 

3 33 w 33 v 3 


gtgttgcttt 


cgattgtggg 

9 — 3 ""333 


actgggegge 


gcggctgtag 

9 9 9 9 9 


tgaaggagaa 


28080 


ggccgatccc 


tgettgeatt 


tcaatcccaa 


caaatgecag 


ctgagttttc 


ageccgatgg 

9 9 99 


28140 


caatcqqtqc 

3 3 3 


qcqqtactqa 


teaagtgegg 


atgggaatgc 


gagaacgtga 


gaatcgagta 


28200 


caataacaag 


acteggaaca 


atactctcgc 


gtccgtgtgg 

3 9 3 33 


cagcccgggg 

9 9 3 9 9 


accccgagtg 


28260 


gtacaccgtc 


tctgtccccg 


qtgetqaegg 


ctccccgcgc 


acegtgaata 


atactttcat 


28320 


ttttgegcac 


atgtgcaaca 


cqgtcatgtq 

33 3 3 


qatgagcaag 

3 ^3^3 3 


cagtacgata 


tgtggccccc 


28380 


cacqaaqqaq 


aacatcgtgg 


tcttctccat 


cgcttacagc 


ctgtgcacgg 


cgctaatcac 


28440 


cgctatcqtq 


tgectgagea 


ttcacatget 


categctatt 


cgccccagaa 


ataatgeega 


28500 


gaaagagaaa 


cagccataac 


aegtttttte 


acacaccttg 


tttttacaga 


caatgegtet 


28560 


gttaaatttt 


ttaaacattg 


tgctcagtat 


tgettatgee 


tctggttatg 


caaacataca 


28620 


gaaaaccctt 


tatgtaggat 


ctgatggtac 


actagagggt acccaatcac 
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tgcatggtat 


ttttatagaa 


ccaacactga 


tccagttaaa 


ctttgtaagg 


gtgaattgcc 


28740 


gcgtacacat 


aaaactccac 


ttacatttag 


ttgcagcaat 


aataatctta 


cacttttttc 


28800 


aattacaaaa 


caatatactg 


gtacttatta 


cagtacaaac 


tttcatacag 


gacaagataa 


28860 


atattatact 


gttaaggtag 


aaaatcctac 


cactcctaga 


actaccacca 


ccaccactac 


28920 


tgcaaagccc 


actgtgaaaa 


ctacaactag 


gaccaccaca 


actacagaaa 


ccaccaccag 


28980 


cacaacactt 


gctgcaacta 


cacacacaca 


cactaagcta 


accttacaga 


ccactaatga 


29040 


tttgatcgcc 


ctgctgcaaa 


agggggataa 


cagcaccact 


tccaatgagg 


agatacccaa 


29100 


atccatgatt 


ggcattattg 


ttgctgtagt 


ggtgtgcatg 


ttgatcatcg 


ccttgtgcat 


29160 


ggtgtactat 


gccttctgct 


acagaaagca 


cagactgaac 


gacaagctgg 


aacacttact 


29220 


aagtgttgaa 


ttttaatttt 


ttagaaccat 


gaagatccta 


ggccttttta 


gtttttctat 


29280 


cattacctct 


gctctttgtg 


aatcagtgga 


tagagatgtt 


actattacca 


ctggttctaa 


29340 


ttatacactg 


aaagggccac 


cctcaggtat 


gctttcgtgg 


tattgctatt 


ttggaactga 


29400 


cactgatcaa 


actgaattat 


gcaattttca 


aaaaggcaaa 


acctcaaact 


ctaaaatctc 


29460 


taattatcaa 


tgcaatggca 


ctgatctgat 


actactcaat 


gtcacgaaag 


catatggtgg 


29520 


cagttattat 


tgccctggac 


aaaacactga 


agaaatgatt 


ttttacaaag 


tggaagtggt 


29580 


tgatcccact 


acaccaccca 


ccaccacaac 


tattcatacc 


acacacacag 


aacaaacacc 


29640 


agaggcaaca 


gaagcagagt 


tggccttcca 


ggttcacgga 


gattcctttg 


ctgtcaatac 


29700 


ccctacaccc 


gatcagcggt 


gtccggggcc 


gctagtcagc 


ggcattgtcg 


gtgtgctttc 


29760 


gggattagca 


gtcataatca 


tctgcatgtt 


catttttgct 


tgctgctata 


gaaggcttta 


29820 


ccgacaaaaa 


tcagacccac 


tgctgaacct 


ctatgtttaa 


ttttttccag 


agccatgaag 


29880 


gcagttagcg 


ctctagtttt 


ttgttctttg 


attggcattg 


tttttaatag 


taaaattacc 


29940 


agagttagct 


ttattaaaca 


tgttaatgta 


actgaaggag 


ataacatcac 


actagcaggt 


30000 


gtagaaggtg 


ctcaaaacac 


cacctggaca 


aaataccatc 


taggatggag 


agatatttgc 


30060 


acctggaatg 


taacttatta 


ttgcatagga 


gttaatctta 


ccattgttaa 


cgctaaccaa 


30120 


tctcagaatg 


ggttaattaa 


aggacagagt 


gttagtgtga 


ccagtgatgg 


gtactatacc 


30180 


cagcatagtt 


ttaactacaa 


cattactgtc 


ataccactgc 


ctacgcctag 


cccacctagc 


30240 


actaccacac 


agacaaccac 


atacagtaca 


tcaaatcagc 


ctaccaccac 


tacagcagca 


30300 


gaggttgcca 


gctcgtctgg 


ggtccgagtg 


gcatttttga 


tgttggcccc 


atctagcagt 


30360 


cccactgcta 


gtaccaatga 


gcagactact 


gaatttttgt 


ccactgtcga 


gagccacacc 


30420 


acagctacct 


ccagtgcctt 


ctctagcacc 


gccaatctct 


cctcgctttc 


ctctacacca 


30480 


atcagccccg 


ctactactcc 


tagccccgct 


cctcttccca 


ctcccctgaa 


gcaaacagac 


30540 


ggcggcatgc 


aatggcagat 


caccctgctc 


attgtgatcg 


ggttggtcat 


cctggccgtg 


30600 


ttgctctact 


acatcttctg 


ccgccgcatt 


cccaacgcgc 


accgcaagcc 


ggcctacaag 


30660 


cccatcgtta 


tcgggcagcc 


ggagccgctt 


caggtggaag 


ggggtctaag 


gaatcttctc 


30720 
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ttctctttta 


cagtatggtg 


attgaactat 


gattcctaga 


caattcttga 


tcactattct 


30780 


tatctgcctc 


ctccaagtct 


gtgccaccct 


cgctctggtg 


gccaacgcca 


gtccagactg 


30840 


tattgggccc 


ttcgcctcct 


acgtgctctt 


tgccttcgtc 


acctgcatct 


getgetgtag 


30900 


catagtctgc 


ctgcttatca 


ccttcttcca 


gttcattgac 


tggatctttg 


tgcgcatcgc 


30960 


ctacctgcgc 


caccaccccc 


agtaccgcga 


ccagcgagtg 


gcgcagctgc 


tcaggctcct 


31020 


ctgataagca 


tgcgggctct 


gctacttctc 


gcgcttctgc 


tgttagtgct 


cccccgtccc 


31080 


gtcgaccccc 


ggtcccccac 


tcagtccccc 


gaggaggttc 


gcaaatgcaa 


attccaagaa 


31140 


ccctggaaat 


tcctcaaatg 


ctaccgccaa 


aaatcagaca 


tgcatcccag 


ctggatcatg 


31200 


atcattggga 


tcgtgaacat 


tctggcctgc 


accctcatct 


cctttgtgat 


ttacccctgc 


31260 


tttgactttg 


gttggaactc 


gccagaggcg 


ctctatctcc 


cgcctgaacc 


tgacacacca 


31320 


ccacagcagc 


aacctcaggc 


acacgcacta 


ccaccaccac 


agcctaggcc 


acaatacatg 


31380 


cccatattag 


actatgaggc 


cgagccacag 


cgacccatgc 


tccccgctat 


tagttacttc 


31440 


aatctaaccg 


gcggagatga 


ctgacccact 


ggccaataac 


aacgtcaacg 


accttctcct 


31500 


ggacatggac 


ggccgcgcct 


cggagcagcg 


actcgcccaa 


cttcgcattc 


gtcagcagca 


31560 


ggagagagcc 


gtcaaggagc 


tgcaggacgg 


catagccatc 


caccagtgca 


agagaggcat 


31620 


cttctgcctg 


gtgaaacagg 


ccaagatctc 


ctacgaggtc 


acccagaccg 


accatcgcct 


31680 


ctcctacgag 


ctcctgcagc 


agcgccagaa 


gttcacctgc 


ctggtcggag 


tcaaccccat 


31740 


cgtcatcacc 


cagcagtcgg 


gcgataccaa 


ggggtgcatc 


cactgctcct 


gcgactcccc 


31800 


cgactgcgtc 


cacactctga 


tcaagaccct 


ctgcggcctc 


cgcgacctcc 


tccccatgaa 


31860 


ctaatcaccc 


ccttatccag 


tgaaataaag 


atcatattga 


tgatgattta 


aataaaaaaa 


31920 


ataatcattt 


gatttgaaat 


aaagatacaa 


tcatattgat 


gatttgagtt 


taacaaaaat 


31980 


aaagaatcac 


ttacttgaaa 


tctgatacca 


ggtctctgtc 


catgttttct 


gccaacacca 


32040 


cctcactccc 


ctcttcccag 


ctctggtact 


gcaggccccg 


gcgggctgca 


aacttcctcc 


32100 


acacgctgaa 


ggggatgtca 


aattcctcct 


gtccctcaat 


cttcatttta 


tcttctatca 


32160 


gatgtccaaa 


aagcgcgtcc 


gggtggatga 


tgacttcgac 


cccgtctacc 


cctacgatgc 


32220 


agacaacgca 


ccgaccgtgc 


ccttcatcaa 


cccccccttc 


gtctcttcag 


atggattcca 


32280 


agagaagccc 


ctgggggtgt 


tgtccctgcg 


actggctgac 


cccgt caeca 


ecaagaaegg 


32340 


ggaaatcacc 


ctcaagctgg 


gagagggggt 


ggacctcgac 


tegtegggaa 


aactcatctc 


32400 


caacacggcc 


accaaggccg 


ccgcccctct 


cagtatttca 


aacaacacca 


tttcccttaa 


32460 


aactgctgcc 


cctttctaca 


acaacaatgg 


aactttaagc 


ctcaatgtct 


ccacaccatt 


32520 


agcagtattt 


cccacattta 


acactttagg 


cataagtctt 


ggaaacggtc 


ttcagacttc 


32580 


aaataagttg 


ttgactgtac 


aactaactca 


tcctcttaca 


ttcagctcaa 


atagcatcac 


32640 


agtaaaaaca 


gacaaagggc 


tatatattaa 


ctccagtgga 


aacagaggac 


ttgaggctaa 


32700 


tataagccta 


aaaagaggac 


tagtttttga 


cggtaatgct 


attgeaacat 


atattggaaa 


32760 


tggcttagac 


tatggatctt 


atgatagtga 


tggaaaaaca agacccgtaa 
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tggagcagga ttaaattttg 


atgctaacaa 


agcaatagct 


gtcaaactag 


gcacaggttt 


32880 


aagttttgac tccgctggtg 


ccttgacagc 


tggaaacaaa 


caggatgaca 


agctaacact 


32940 


ttggactacc cctgacccaa 


gccctaattg 


tcaattactt 


tcagacagag 


atgccaaatt 


33000 


tactctctgt cttacaaaat 


gcggtagtca 


aatactaggc 


actgtggcag 


tggcggctgt 


33060 


tactgtagga tcagcactaa 


atccaattaa 


tgacacagtc 


aaaagcgcca 


tagttttcct 


33120 


tagatttgat tccgatggtg 


tactcatgtc 


aaactcatca 


atggtaggtg 


attactggaa 


33180 


ctttagggag ggacagacca 


ctcaaagtgt 


agcctataca 


aatgctgtgg 


gattcatgcc 


33240 


aaatataggt gcatatccaa 


aaacccaaag 


taaaacacct 


aaaaatagca 


tagtcagtca 


33300 


ggtatattta actggagaaa 


ctactatgcc 


aatgacacta 


accataactt 


tcaatggcac 


33360 


tgatgaaaaa gacacaaccc 


cagttagcac 


ctactctatg 


acttttacat 


ggcagtggac 


33420 


tggagactat aaggacaaaa 


atattacctt 


tgctaccaac 


tcattctctt 


tttcctacat 


33480 


cgcccaggaa taatcccacc 


cagcaagcca 


accccttttc 


ccaccacctt 


tgtctatatg 


33540 


gaaactctga aacagaaaaa 


taaagttcaa 


gtgttttatt 


gaatcaacag 


ttttacagga 


33600 


ctcgagcagt tatttttcct 


ccaccctccc 


aggacatgga 


atacaccacc 


ctctcccccc 


33660 


gcacagcctt gaacatctga 


atgccattgg 


tgatggacat 


gcttttggtc 


tccacgttcc 


33720 


acacagtttc agagcgagcc 


agtctcggat 


cggtcaggga 


gatgaaaccc 


tccgggcact 


33780 


cccgcatctg cacctcacag 


ctcaacagct 


gaggattgtc 


ctcggtggtc 


gggatcacgg 


33840 


ttatctggaa gaagcagaag 


agcggcggtg 


ggaatcatag 


tccgcgaacg 


ggatcggccg 


33900 


gtggtgtcgc atcaggcccc 


gcagcagtcg 


ctgccgccgc 


cgctccgtca 


agctgctgct 


33960 


cagggggttc gggtccaggg 


actccctcag 


catgatgccc 


acggccctca 


gcatcagtcg 


34020 


tctggtgcgg cgggcgcagc 


agcgcatgcg 


aatctcgctc 


aggtcactgc 


agtacgtgca 


34080 


acacaggacc accaggttgt 


tcaacagtcc 


atagttcaac 


acgctccagc 


cgaaactcat 


34140 


cgcgggaagg atgctaccca 


cgtggccgtc 


gtaccagatc 


ctcaggtaaa 


tcaagtggcg 


34200 


ctccctccag aagacgctgc 


ccatgtacat 


gatctccttg 


ggcatgtggc 


ggttcaccac 


34260 


ctcccggtac cacatcaccc 


tctggttgaa 


catgcagccc 


cggatgatcc 


tgcggaacca 


34320 


cagggccagc accgccccgc 


ccgccatgca 


gcgaagagac 


cccggatccc 


ggcaatgaca 


34380 


atggaggacc caccgctcgt 


acccgtggat 


catctgggag 


ctgaacaagt 


ctatgttggc 


34440 


acagcacagg catatgctca 


tgcatctctt 


cagcactctc 


agctcctcgg 


gggtcaaaac 


34500 


catatcccag ggcacgggga 


actcttgcag 


gacagcgaac 


cccgcagaac 


agggcaatcc 


34560 


tcgcacataa cttacattgt 


gcatggacag 


ggtatcgcaa 


tcaggcagca 


ccgggtgatc 


34620 


ctccaccaga gaagcgcggg 


tctcggtctc 


ctcacagcgt 


ggtaaggggg 


ccggccgata 


34680 


cgggtgatgg cgggacgcgg 


ctgatcgtgt 


tctcgaccgt 


gtcatgatgc 


agttgctttc 


34740 


ggacattttc gtacttgctg 


tagcagaacc 


tggtccgggc 


gctgcacacc 


gatcgccggc 


34800 


ggcggtctcg gcgcttggaa 


cgctcggtgt 


taaagttgta 


aaacagccac 


tctctcagac 


34860 
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lagatccc atcatgcctg atagctctga 34920 

igccagat gatgcaattt tgttgggttt 34980 

accatgat taacttttaa tccaaacggt 35040 

taacacct ctcqcccccg ctgtgttqqt 35100 



Ly LyLdyLdy 




LLayyay Lya 


LLdLdLLydL 


LdLLy Lyy dd 


LyyyLLayyL 


Lyy LydLyyL 


yyyyydyyyd 


ayacxv^ayycLCL 


c ucyydy LdL. 


ttraaaatna 
LlLdddd uyd 


dyy LLdLyy d 


ygdaaaladC 


anrrannf ra 
dy LLdyy Ltd 


aanntnatar 
ddyy LydLdL 




r*a ffifflfa i~a 
LdLy Ly LdLa 


trranaaa ra 


dLLLCLLdaL 


LdLLd Ly L v.d 


ra r1"rrl"Cira 

LdL LLL Ly Lu 


rttnaatnat 
V- L Lydd uyd L 


trnaartant 
LLyddL Lay l 


trrtnannta 
lll Ly ay y La 


nrananrarr 
y v-dydy lcill 


L LLLaLLyyL 


a l ll l Laayv. 




tnranranat 

LyL.dyL.dyd l 


tnaraanrnn 
Ly dLddy Lyy 


L LLLLLLL LL 


anraataart 
dyLddLddLL 


n 1* a a n t" a r 1" c 
y Lddy LdL ll 


art/ia/T*r*^^a 
dyydLLLLLd 


y y dd Lddy dy 


aannnraanr 
ddy y yLday l 


y uyaycat ty 


LLddd Ly Ldd 


nattnaaata 
y a l Ly aaa La 


ranataartn 

Ldyd LdaL uy 


naranaaaat 
ydLdydddd l 


i.yyy L.o.ciy L.a 


ttrranntnr 

LLLLdyy ty l 


arntftannn 
dL.y l l Ldy yy 


rr"l"<"nnnaar 
ll LLyyy aaL 


cdycdtyy l l 


anftanrtna 

dy l Ldy u Lyd 


trtntaaaaa 
LL Ly Lddddd 


c r"f~ rtn/"" na 3 I'- 
ll Lyy tyddL 


a nni*rifin+"a a 

agy ugyyi-dd 


a t* r n"M" r t* rt* 

dLLyLLLLLL 


Ly LydLLL LL 


ntaaaaattfl 
y Lddddd L Ly 


1*rnr1"a1"na "t" 
LLy l La Ly a l 


ggccggcgng 


a atnattrna 
dd Lyd L LL.y d 


naanaanrat 
yddyaayLa l 


cicicLciciciciy Ly 


y LL.y ayy aay 




LLjd Ly LLd Ly 


rnnatnaanr 

L>y y d Ly day L, 


a raaaat*1"t"t" 


^f"1*npaf ann 


ranrnaanrt 
LdyLyaayL L 


cc c n a t* c cc 1" 


L ay Q. L L a l l y 


dy L,yy Ldy L>d 


y Ldy Lyy LdL 


ydyC LL Laav. 


L Ly LLLyLLL 


y L L L L L Ly L L 


aa an/irra 3 ta 
dddyy LLddd 


ntrtaaa aat 
y LL Lddddd L 


acccr\cc7k a a 
aLLLy LLdda 


dLLyy LydLd 


rartranaaa 
LdL LLdy ddd 


aat" a cncnczt 
dd LdLyLy La 




dLy L LdLy LL 


atraaaarar 
a LLaaaaLaL 


atcacccgcc 


ccgcccctaa 


cggtcgccgc 


aaattcaaac 


agctcatttg 


catattaacg 


gatg 







atccccag ataattttca tttttccagc 35280 

tccaagcc agccatgata aaaagctcgc 35340 

accctcat aattccaaga tattctgctc 35400 

tatcaaaa tctctgccgc gatccctgag 35460 

tcatatcg tctccgaaat ttttagccat 35520 

cattacag ataaaccgaa gtccccccca 35580 

catgctgg ctagacccgg tgatatcttc 35640 

ttttaaga aaatcaacaa aagaaaaatc 35700 

cgatggag taagtgcaag gggtgcgttc 35760 

caaaaaat aaaacattaa accatgctag 35820 

agcaccag gcaggccacg gggtctccgg 35880 

aaaaccat cacagagaga cgttcccggt 35940 

acccccgg aacattggag tccgtgagtg 36000 

acaacgct cactctcaag tccagcaaag 36060 

ggtgcgta aaaaatgtaa ttactcccct 36120 

agatacac atacaaagcc tcagcgtcca 36180 

aacaggcg caagagtcag agaaaagact 36240 

atatatag ccccagatct acactgacgt 36300 

atcacaca cgcccagcac acgcccagaa 36360 

tcctcaaa cggccaaact gccgtcattt 36420 

ctttcaaa ttccgtcgac cgttaaaaac 36480 

ccgcagcc aatcaccttc ctccctcccc 36540 

caccaaaa gtttgaggta tattattgat 36600 

36604 

<210> 3 
<211> 36535 
<212> DNA 

<213> chimpanzee adenovirus serotype Pan7 
<400> 3 

catcatcaat aatatacctc aaacttttgg tgcgcgttaa tatgcaaatg agctgtttga 60 

Page 36 



WO 2005/001103 PCTYUS2004/016614 

atttggggag ggaggaaggt gattggccga gagacgggcg accgttaggg gcggggcggg 120 

tgacgttttt aatacgtggc cgtgaggcgg agccggtttg caagttctcg tgggaaaagt 180 

gacgtcaaac gaggtgtggt ttgaacacgg aaatactcaa ttttcccgcg ctctctgaca 240 

ggaaatgagg tgtttctggg cggatgcaag tgaaaacggg ccattttcgc gcgaaaactg 300 

aatgaggaag tgaaaatctg agtaatttcg cgtttatggc agggaggagt atttgccgag 360 

ggccgagtag actttgaccg attacgtggg ggtttcgatt accgtatttt tcacctaaat 420 

ttccgcgtac ggtgtcaaag tccggtgttt ttacgtaggc gtcagctgat cgccagggta 480 

tttaaacctg cgctctctag tcaagaggcc actcttgagt gccagcgagt agagttttct 540 

cctccgcgcc gcgagtcaga tctacacttt gaaagatgag gcacctgaga gacctgcccg 600 

gtaatgtttt cctggctact gggaacgaga ttctggaatt ggtggtggac gccatgatgg 660 

gtggcgaccc tcctgagccc cctaccccat ttgaggcgcc ttcgctgtac gatttgtatg 720 

atctggaggt ggatgtgccc gagaacgacc ccaacgagga ggcggtgaat gatttgttta 780 

gcgatgccgc gctgctggct gccgagcagg ctaatacgga ctctggctca gacagcgatt 840 

cctctctcca taccccgaga cccggcagag gtgagaaaaa gatccccgag cttaaagggg 900 

aagagctcga cctgcgctgc tatgaggaat gcttgcctcc gagcgatgat gaggaggacg 960 

aggaggcgat tcgagctgca tcgaaccagg gagtgaaagc tgcgggcgaa agctttagcc 1020 

tggactgtcc tactctgccc ggacacggct gtaagtcttg tgaatttcat cgcatgaata 1080 

ctggagataa gaatgtgatg tgtgccctgt gctatatgag agcttacaac cattgtgttt 1140 

acagtaagtg tgattaactt tagttgggaa ggcagagggt gactgggtgc tgactggttt 1200 

atttatgtat atgttttttt atgtgtaggt cccgtctctg acgtagatga gacccccact 1260 

tcagagtgca tttcatcacc cccagaaatt ggcgaggaac cgcccgaaga tattattcat 1320 

agaccagttg cagtgagagt caccgggcgg agagcagctg tggagagttt ggatgacttg 1380 

ctacagggtg gggatgaacc tttggacttg tgtacccgga aacgccccag gcactaagtg 1440 

ccacacatgt gtgtttactt aaggtgatgt cagtatttat agggtgtgga gtgcaataaa 1500 

atccgtgttg actttaagtg cgtggtttat gactcagggg tggggactgt gggtatataa 1560 

gcaggtgcag acctgtgtgg tcagttcaga gcaggactca tggagatctg gacggtcttg 1620 

gaagactttc accagactag acagctgcta gagaactcat cggagggggt ctcttacctg 1680 

tggagattct gcttcggtgg gcctctagct aagctagtct atagggccaa acaggattat 1740 

aaggatcaat ttgaggatat tttgagagag tgtcctggta tttttgactc tctcaacttg 1800 

ggccatcagt ctcactttaa ccagagtatt ctgagagccc ttgacttttc tactcctggc 1860 

agaactaccg ccgcggtagc cttttttgcc tttatccttg acaaatggag tcaagaaacc 1920 

catttcagca gggattaccg tctggactgc ttagcagtag ctttgtggag aacatggagg 1980 

tgccagcgcc tgaatgcaat ctccggctac ttgccagtac agccggtaga cacgctgagg 2040 

atcctgagtc tccagtcacc ccaggaacac caacgccgcc agcagccgca gcaggagcag 2100 

cagcaagagg aggaggagga tcgagaagag aacccgagag ccggtctgga ccctccggtg 2160 
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gcggaggagg 


aggagtagct 


gacttgtttc 


ccgagctgcg 


ccgggtgctg 


actaggtctt 


2220 


ccagtggacg 


ggagaggggg 


attaagcggg 


agaggcatga 


ggagactagc 


cacagaactg 


2280 


aactgactgt 


cagtctgatg 


agccgcaggc 


gcccagaatc 


ggtgtggtgg 


catgaggttc 


2340 


agtcgcaggg 


gatagatgag 


gtctcggtga 


tgcatgagaa 


atattccctg 


gaacaagtca 


2400 


agacttgttg 


gttggagcct 


gaggatgatt 


gggaggtagc 


catcaggaat 


tatgccaagc 


2460 


tggctctgaa 


gccagacaag 


aagtacaaga 


ttaccaaact 


gattaatatc 


agaaattcct 


2520 


gctacatttc 


agggaatggg 


gccgaggtgg 


agatcagtac 


ccaggagagg 


gtggccttca 


2580 


gatgttgtat 


gatgaatatg 


tacccggggg 


tggtgggcat 


ggagggagtc 


acctttatga 


2640 


acgcgaggtt 


caggggtgat 


gggtataatg 


gggtggtctt 


tatggccaac 


accaagctga 


2700 


cagtgcacgg 


atgctccttc 


tttgggttca 


ataacatgtg 


catcgaggcc 


tggggcagtg 


2760 


tttcagtgag 


gggatgcagc 


ttttcagcca 


actggatggg 


ggtcgtgggc 


agaaccaaga 


2820 


gcaaggtgtc 


agtgaagaaa 


tgcctgttcg 


agaggtgcca 


cctgggggtg 


atgagcgagg 


2880 


gcgaagccaa 


agtcaaacac 


tgcgcctcta 


ctgagacggg 


ctgctttgtg 


ctgatcaagg 


2940 


gcaatgccca 


agtcaagcat 


aacatgatct 


gtggggcctc 


ggatgagcgc 


ggctaccaga 


3000 


tgctgacctg 


cgccggtggg 


aacagccata 


tgctggccac 


cgtgcatgtg 


acctcgcacc 


3060 


cccgcaagac 


atggcccgag 


ttcgagcaca 


acgtcatgac 


ccgatgcaat 


gtgcacctgg 


3120 


ggtcccgccg 


aggcatgttc 


atgccctacc 


agtgcaacat 


gcaatttgtg 


aaggtgctgc 


3180 


tggagcccga 


tgccatgtcc 


agagtgagcc 


tgacgggggt 


gtttgacatg 


aatgtggagc 


3240 


tgtggaaaat 


tctgagatat 


gatgaatcca 


agaccaggtg 


ccgggcctgc 


gaatgcggag 


3300 


gcaagcacgc 


caggcttcag 


cccgtgtgtg 


tggaggtgac 


ggaggacctg 


cgacccgatc 


3360 


atttggtgtt 


gtcctgcaac 


gggacggagt 


tcggctccag 


cggggaagaa 


tctgactaga 


3420 


gtgagtagtg 


tttgggggag 


gtggagggct 


tgtatgaggg 


gcagaatgac 


taaaatctgt 


3480 


gtttttctgt 


gtgttgcagc 


agcatgagcg 


gaagcgcctc 


ctttgaggga 


ggggtattca 


3540 


gcccttatct 


gacggggcgt 


ctcccctcct 


gggcgggagt 


gcgtcagaat 


gtgatgggat 


3600 


ccacggtgga 


cggccggccc 


gtgcagcccg 


cgaactcttc 


aaccctgacc 


tacgcgaccc 


3660 


tgagctcctc 


gtccgtggac 


gcagctgccg 


ccgcagctgc 


tgcttccgcc 


gccagcgccg 


3720 


tgcgcggaat 


ggccctgggc 


gccggctact 


acagctctct 


ggtggccaac 


tcgacttcca 


3780 


ccaataatcc 


cgccagcctg 


aacgaggaga 


agctgctgct 


gctgatggcc 


cagctcgagg 


3840 


ccctgaccca 


gcgcctgggc 


gagctgaccc 


agcaggtggc 


tcagctgcag 


gcggagacgc 


3900 


gggccgcggt 


tgccacggtg 


aaaaccaaat 


aaaaaatgaa 


tcaataaata 


aacggagacg 


3960 


gttgttgatt 


ttaacacaga 


gtcttgaatc 


tttatttgat 


ttttcgcgcg 


cggtaggccc 


4020 


tggaccaccg 


gtctcgatca 


ttgagcaccc 


ggtggatttt 


ttccaggacc 


cggtagaggt 


4080 


gggcttggat 


gttgaggtac 


atgggcatga 


gcccgtcccg 


ggggtggagg 


tagctccatt 


4140 


gcagggcctc 


gtgctcgggg 


gtggtgttgt 


aaatcaccca 


gtcatagcag 


gggcgcaggg 


4200 
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cgtqgtqctg 


cacgatgtcc 


ttgaggagga 


gactgatggc 


cacgggcagc 


cccttggtgt 


4260 


aggtgttgac 

33 3 3 


gaacctgttg 


agctgggagg 


gatgcatgcg 


gggggagatg 


agatgcatct 


4320 


tggcctggat 

ZJ ZJ 33 


cttgagattg 


gcgatgttcc 


cgcccagatc 


ccgccggggg 


ttcatgttgt 


4380 


gcaggaccac 


cagcacggtg 

3 *-/ ZJ 


tatccggtgc 


acttggggaa 


tttgtcatgc 


aacttggaag 


4440 


ggaaggcgtg 

33 33 3 3 


aaagaatttg 


gagacgccct 


tgtgaccgcc 


caggttttcc 


atgcactcat 


4500 


ccatgatgat 


ggcgatgggc 

33 3 333 


ccgtgggcgg 


cggcctgggc 


aaagacgttt 


cgggggtcgg 


4560 


acacatcgta 


gttgtggtcc 


tgggtgagct 


cgtcataggc 


cattttaatg 


aatttggggc 


4620 


ggagggtgcc 


cgactggggg 


acgaaggtgc 


cctcgatccc 


gggggcgtag 


ttgccctcgc 


4680 


agatctgcat 


ctcccaggcc 


ttgagctcgg 


agggggggat 


catgtccacc 


tgcggggcga 


4740 


tgaaaaaaac 


ggtttccggg 


gcgggggaga 


tgagctgggc 


cgaaagcagg 


ttccggagca 


4800 


gctgggactt 


gccgcagccg 


gtggggccgt 


agatgacccc 


gatgaccggc 


tgcaggtggt 


4860 


agttgaggga 

3 3 33 3 


gagacagctg 


ccgtcctcgc 


ggaggagggg 


ggccacctcg 


ttcatcatct 


4920 


cgcgcacatg 


catgttctcg 


cgcacgagtt 


ccgccaggag 


gcgctcgccc 


cccagcgaga 


4980 


ggagctcttg 


cagcgaggcg 


aagtttttca 


gcggcttgag 


yccgtcggcc 


atgggcattt 


5040 


tggagagggt 


ctgttgcaag 


agttccagac 


ggtcccagag 


ctcggtgatg 


tgctctaggg 


5100 


catctcgatc 


cagcagacct 


cctcgtttcg 


cgggttgggg 


cgactgcggg 


agtagggcac 


5160 


caggcgatgg 

3 3 3 33 


gcgtccagcg 


aggccagggt 


ccggtccttc 


cagggtcgca 


gggtccgcgt 


5220 


cagcgtggtc 


tccgtcacgg 


tgaaggggtg 


cgcgccgggc 


tgggcgcttg 


cgagggtgcg 


5280 


cttcaggctc 


atccggctgg 


tcgagaaccg 


ctcccggtcg 


gcgccctgcg 


cgtcggccag 


5340 


gtagcaattg 


agcatgagtt 


cgtagttgag 


cgcctcggcc 


gcgtggccct 


tggcgcggag 


5400 


cttacctttg 


gaagtgtgtc 


cgcagacggg 


acagaggagg 


gacttgaggg 


cgtagagctt 


5460 


gggggcgagg 

ZJ ^ ^ ZJ Zj 3 33 


aagacggact 

3 3 3 


cgggggcgta 


ggcgtccgcg 


ccgcagctgg 


cgcagacggt 


5520 


ctcgcactcc 


acgagccagg 


tgaggtcggg 


ccggttgggg 


tcaaaaacga 


ggtttcctcc 


5580 


gtgctttttg 


atgcgtttct 


tacctctggt 


ctccatgagc 


tcgtgtcccc 


gctgggtgac 


5640 


aaagaggctg 


tccgtgtccc 


cgtagaccga 


ctttatgggc 


cggtcctcga 


gcggggtgcc 


5700 


gcggtcctcg 


tcgtagagga 


accccgccca 


ctccgagacg 


aaggcccggg 


tccaggccag 


5760 


cacgaaggag 


gccacgtggg 


aggggtagcg 


gtcgttgtcc 


accagcgggt 


ccaccttctc 


5820 


cagggtatgc 


aagcacatgt 


ccccctcgtc 


cacatccagg 


aaggtgattg 


gcttgtaagt 


5880 


gtaggccacg 


tgaccggggg 


tcccggccgg 


gggggtataa 


aagggggcgg 


gcccctgctc 


5940 


gtcctcactg 


tcttccggat 


cgctgtccag 


gagcgccagc 


tgttggggta 


ggtattccct 


6000 


ctcgaaggct 


ggcataacct 


cggcactcag 


gttgtcagtt 


tctagaaacg 


aggaggattt 


6060 


gatattgacg 


gtgccgttgg 


agacgccttt 


catgagcccc 


tcgtccatct 


ggtcagaaaa 


6120 


gacgatcttt 


ttgttgtcga 


gcttggtggc 


gaaggagccg 


tagagggcgt 


tggagaggag 


6180 


cttggcgatg 


gagcgcatgg 


tctggttctt 


ttccttgtcg 


gcgcgctcct 


tggcggcgat 


6240 


gttgagctgc 


acgtactcgc 


gcgccacgca 


cttccattcg gggaagacgg 
Page 39 


tggtgagctc 


6300 



WO 2005/001103 



PCT/US2004/016614 



gtcgggcacg 


attctgaccc 


gccagccgcg 


gttgtgcagg 


gtgatgaggt 


ccacgctggt 


6360 


ggccacctcg 


ccgcgcaggg 


gctcgttggt 


ccagcagagg 


cgcccgccct 


tgcgcgagca 


6420 


gaaggggggc 


agcgggtcca 


gcatgagctc 


gtcggggggg 


tcggcgtcca 


cggtgaagat 


6480 


gccgggcaga 


agctcggggt 


cgaagtagct 


gatgcaggtg 


tccagatcgt 


ccagcgccgc 


6540 


ttgccagtcg 


cgcacggcca 


gcgcgcgctc 


gtaggggctg 


aggggcgtgc 


cccagggcat 


6600 


ggggtgcgtg 


agcgcggagg 


cgtacatgcc 


gcagatgtcg 


tagacgtaga 


ggggctcctc 


6660 


gaggacgccg 


atgtaggtgg 


ggtagcagcg 


ccccccgcgg 


atgctggcgc 


gcacgtagtc 


6720 


gtacagctcg 


tgcgagggcg 


cgaggagccc 


cgtgccgagg 


ttggagcgtt 


gcggcttttc 


6780 


ggcgcggtag 


acgatctggc 


ggaagatggc 


gtgggagttg 


gaggagatgg 


tgggcctctg 


6840 


gaagatgttg 


aagtgggcgt 


ggggcaggcc 


gaccgagtcc 


ctgatgaagt 


gggcgtagga 


6900 


gtcctgcagc 


ttggcgacga 


gctcggcggt 


gacgaggacg 


tccagggcgc 


agtagtcgag 


6960 


ggtctcttgg 


atgatgtcgt 


acttgagctg 


gcccttctgc 


ttccacagct 


cgcggttgag 


7020 


aaggaactct 


tcgcggtcct 


tccagtactc 


ttcgaggggg 


aacccgtcct 


gatcggcacg 


7080 


gtaagagccc 


accatgtaga 


actggttgac 


ggccttgtag 


gcgcagcagc 


ccttctccac 


7140 


ggggagggcg 


taagcttgtg 


cggccttgcg 


cagggaggtg 


tgggtgaggg 


cgaaggtgtc 


7200 


gcgcaccatg 


accttgagga 


actggtgctt 


gaagtcgagg 


tcgtcgcagc 


cgccctgctc 


7260 


ccagagctgg 


aagtccgtgc 


gcttcttgta 


ggcggggttg 


ggcaaagcga 


aagtaacatc 


7320 


gttgaagagg 


atcttgcccg 


cgcggggcat 


gaagttgcga 


gtgatgcgga 


aaggctgggg 


7380 


cacctcggcc 


cggttgttga 


tgacctgggc 


ggcgaggacg 


atctcgtcga 


agccgttgat 


7440 


gttgtgcccg 


acgatgtaga 


gttccacgaa 


tcgcgggcgg 


cccttaacgt 


ggggcagctt 


7500 


cttgagctcg 


tcgtaggtga 


gctcggcggg 


gtcgctgagc 


ccgtgctgct 


cgagggccca 


7560 


gtcggcgacg 


tgggggttgg 


cgctgaggaa 


ggaagtccag 


agatccacgg 


ccagggcggt 


7620 


ctgcaagcgg 


tcccggtact 


gacggaactg 


ctggcccacg 


gccatttttt 


cgggggtgac 


7680 


gcagtagaag 


gtgcgggggt 


cgccgtgcca 


gcggtcccac 


ttgagctgga 


gggcgaggtc 


7740 


gtgggcgagc 


tcgacgagcg 


gcgggtcccc 


ggagagtttc 


atgaccagca 


tgaaggggac 


7800 


gagctgcttg 


ccgaaggacc 


ccatccaggt 


gtaggtttcc 


acatcgtagg 


tgaggaagag 


7860 


cctttcggtg 


cgaggatgcg 


agccgatggg 


gaagaactgg 


atctcctgcc 


accagttgga 


7920 


ggaatggctg 


ttgatgtgat 


ggaagtagaa 


atgccgacgg 


cgcgccgagc 


actcgtgctt 


7980 


gtgtttatac 


aagcgtccgc 


agtgctcgca 


acgctgcacg 


ggatgcacgt 


gctgcacgag 


8040 


ctgtacctgg 


gttcctttga 


cgaggaattt 


cagtgggcag 


tggagcgctg 


gcggctgcat 


8100 


ctggtgctgt 


actacgtcct 


ggccatcggc 


gtggccatcg 


tctgcctcga 


tggtggtcat 


8160 


gctgacgagc 


ccgcgcggga 


ggcaggtcca 


gacttcggct 


cggacgggtc 


ggagagcgag 


8220 


gacgagggcg 


cgcaggccgg 


agctgtccag 


ggtcctgaga 


cgctgcggag 


tcaggtcagt 


8280 


gggcagcggc 


ggcgcgcggt 


tgacttgcag 


gagcttttcc 


agggcgcgcg 


ggaggtccag 


8340 
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atggtacttg 


atctccacgg 


cgccgttggt 

*3 3 33 


ggcgacgtcc 

ZJ ZJ ZJ Z> 


acggcttgca 


gggtcccgtg 


8400 


cccctggggc 


gccaccaccg 


tgccccgttt 


cttcttgggc 


gctgcttcca 


tgccggtcag 


8460 


aagcggcggc 


gaggacgcgc 

ZJ ZJ ZJ Z? ZJ 


gccgggcggc 


aggggcggct 


cgggacccgg 


aggcaggggc 


8520 


ggcaggggca 


cgtcggcgcc 
3 33 ^3 


gcgcgcgggc 

33 3 333 


aggttctggt 

33 33 


actgcgcccg 


gagaagactg 


8580 


gcgtgagcga 


cgacgcgacg 


gttgacgtcc 


tggatctgac 


gcctctgggt 


gaaggccacg 


8640 


ggacccgtga 


gtttgaacct 


gaaagagagt 

3 333 


tcgacagaat 

3 3 


caatctcggt 


atcgttgacg 


8700 


gcggcctgcc 

3 3 3 3 


gcaggatctc 


ttgcacgtcg 


cccgagttgt 


cctggtaggc 


gatctcggtc 


8760 


atgaactgct 


cgatctcctc 


ctcctgaagg 


tctccgcggc 


cggcgcgctc 


gacggtggcc 


8820 


gcgaggtcgt 


tggagatgcg 

ZJ ZJ ZJ 3 3 


gcccatgagc 


tgcgagaagg 


cgttcatgcc 


ggcctcgttc 


8880 


cagacgcggc 

3 3 33 


tgtagaccac 


ggctccgtcg 


gggtcgcgcg 
333 "■^•3^*3^3 


cgcgcatgac 


cacctgggcg 


8940 


aggttgagct 

3 3 ™ 3 3 


cqacgtggcg 
3 3 333 


cgtgaagacc 


gcgtagttgc 
3 3 ^3 


agaggcgctg 

ZJ ZJ ZJ ZJ ~3 


gtagaggtag 


9000 


ttgagcqtqg 

»"3 *-*3 ^-3 *- 3 3 


tqqcqatqtq 


ctcggtgacg 


aagaagtaca 


tgatccagcg 


gcgqagcggc 
333 3 33* 


9060 


atctcgctga 


cgtcgcccag 


ggcttccaag 


cgctccatgg 
3 33 


cctcgtagaa 


gtccacggcg 


9120 


aagttgaaaa 


actgqgaqtt 


qcqcgccqaq 


acggtcaact 


cctcctccag 


aagacggatg 

3 3 3 3 


9180 


agctcagcga 


tggtggcgcg 


cacctcgcgc 


tcgaaggccc 


cggggggctc 


ctcttcttcc 


9240 


atctcttcct 


cctccactaa 


catctcttct 


acttcctcct 


caggaggcgg 


cggcggggga 


9300 


ggggccctgc 

3333 3 


gtcgccggcg 

3 "•* , *3'' ,,, ' , '33 3 


gcgcacgggc 


agacggtcga 


tgaagcgctc 


gatggtctcc 


9360 


ccgcgccggc 

3'"3* ,, '33** 


gacgcatggt 


ctcggtgacg 


gcgcgcccgt 


cctcgcgggg 

"3 ^^^S 


ccgcagcgtg 

3 333 


9420 


aagacgccgc 


cgcgcatctc 

3 3 


caggtggccg 

33 33 3 


ccgggggggt 


ctccgttggg 


cagggagagg 


9480 


gcgctgacga 


tgcatcttat 


caattggccc 


gtagggactc 


cgcgcaagga 


cctgagcgtc 


9540 


tcgagatcca 


cgggatccga 

333 3 


aaaccgctga 

3 3 


acgaaggctt 


cgagccagtc 


gcagtcgcaa 


9600 


ggtaggctga 

33 33 3 


gcccggtttc 


ttgttcttcg 


gggatttcgg 


gaggcgggcg 

3 33 333 3 


ggcgatgctg 

333 3 3 


9660 


ctggtgatga 


agttgaagta 


ggcqgtcctg 

zj ~j ~j zj 3 


agacggcgga 


tggtggcgag 

33 33 3 3 


gagcaccagg 


9720 


tccttgggcc 


cggcttgctg 


gatgcgcaga 


cggtcggcca 
33 33 


tgccccaggc 


gtggtcctga 


9780 


cacctggcga 


ggtccttgta 


gtagtcctgc 


atgagccgct 


ccacgggcac 


ctcctcctcg 


9840 


cccgcgcggc 


cgtgcatgcg 


cgtgagcccg 


aacccgcgct 


ggggctggac 

j g 3 3 3 


gagcgccagg 


9900 


tcggcgacga 


cgcgctcggc 


gaggatggcc 


tgctgtatct 


gggtgagggt 

333 ^ 


ggtctggaag 

33 33 3 


9960 


tcgtcgaagt 


cgacgaagcg 


gtggtaggct 


ccggtgttga 


tggtatagga 


gcagttggcc 


10020 


atgacggacc 


agttgacggt 


ctggtggccg 

33 33 3 


ggtcgcacga 


gctcgtggta 

3 3 33 


cttgaggcgc 


10080 


gagtaggcgc 
3 3 33 3 


gcgtgtcgaa 


gatgtagtcg 


ttgcaggtgc 

^ ZJ 3 3 w 3 


gcacgaggta 
3 3 33 


ctggtatccg 


10140 


acgaggaagt 


gcggcggcgg 
3 33 33 33 


ctggcggtag 

zj zj zj zj 3 


agcggccatc 


gctcggtggc 

3 33 33 


gggggcgccg 

ZJ Zs ZJ ZJ ZJ 3 3 


10200 


ggcgcgaggt 


cctcgagcat 


gaggcggtgg 

33333 3 3 


tagccgtaga 


tgtacctgga 


catccaggtg 


10260 


atgccggcgg 


cggtggtgga 


OQcgcgcggg 


aac Lcgcgga 


cgcggtLCca 


gatg LLgcgc 




agcggcagga 


agtagttcat 


ggtggccgcg 


gtctggcccg 


tgaggcgcgc 


gcagtcgtgg 


10380 


atgctctaga 


catacgggca 


aaaacgaaag 


cggtcagcgg 


ctcgactccg 


tggcctggag 


10440 
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gctaagcgaa 


cgggttgggc 


tgcgcgtgta 


ccccggttcg 


aatctcgaat 


caggctggag 


10500 


ccgcagctaa 


cgtggtactg 


gcactcccgt 


ctcgacccaa 


gcctgctaac 


gaaacctcca 


10560 


ggatacggag 


gcgggtcgtt 


ttttggcctt 


ggtcgctggt 


catgaaaaac 


tagtaagcgc 


10620 


ggaaagcgac 


cgcccgcgat 


ggctcgctgc 


cgtagtctgg 


agaaagaatc 


gccagggttg 


10680 


cgttgcggtg 


tgccccggtt 


cgagcctcag 


cgctcggcgc 


cggccggatt 


ccgcggctaa 


10740 


cgtgggcgtg 


gctgccccgt 


cgtttccaag 


accccttagc 


cagccgactt 


ctccagttac 


10800 


ggagcgagcc 


cctctttttc 


ttgtgttttt 


gccagatgca 


tcccgtactg 


cggcagatgc 


10860 


gcccccaccc 


tccacctcaa 


ccgcccctac 


cgccgcagca 


gcagcaacag 


ccggcgcttc 


10920 


tgcccccgcc 


ccagcagcag 


ccagccacta 


ccgcggcggc 


cgccgtgagc 


ggagccggcg 


10980 


ttcagtatga 


cctggccttg 


gaagagggcg 


aggggctggc 


gcggctgggg 


gcgtcgtcgc 


11040 


cggagcggca 


cccgcgcgtg 


cagatgaaaa 


gggacgctcg 


cgaggcctac 


gtgcccaagc 


11100 


agaacctgtt 


cagagacagg 


agcggcgagg 


agcccgagga 


gatgcgcgcc 


tcccgcttcc 


11160 


acgcggggcg 


ggagctgcgg 


cgcggcctgg 


accgaaagcg 


ggtgctgagg 


gacgaggatt 


11220 


tcgaggcgga 


cgagctgacg 


gggatcagcc 


ccgcgcgcgc 


gcacgtggcc 


gcggccaacc 


11280 


tggtcacggc 


gtacgagcag 


accgtgaagg 


aggagagcaa 


cttccaaaaa 


tccttcaaca 


11340 


accacgtgcg 


cacgctgatc 


gcgcgcgagg 


aggtgaccct 


gggcctgatg 


cacctgtggg 


11400 


acctgctgga 


ggccatcgtg 


cagaacccca 


cgagcaagcc 


gctgacggcg 


cagctgtttc 


11460 


tggtggtgca 


gcacagtcgg 


gacaacgaga 


cgttcaggga 


ggcgctgctg 


aatatcaccg 


11520 


agcccgaggg 


ccgctggctc 


ctggacctgg 


tgaacattct 


gcagagcatc 


gtggtgcagg 


11580 


agcgcgggct 


gccgctgtcc 


gagaagctgg 


cggctatcaa 


cttctcggtg 


ctgagcctgg 


11640 


gcaagtacta 


cgctaggaag 


atctacaaga 


ccccgtacgt 


gcccatagac 


aaggaggtga 


11700 


agatcgacgg 


gttttacatg 


cgcatgaccc 


tgaaagtgct 


gaccctgagc 


gacgatctgg 


11760 


gggtgtaccg 


caacgacagg 


atgcaccgcg 


cggtgagcgc 


cagccgccgg 


cgcgagctga 


11820 


gcgaccagga 


gctgatgcac 


agcctgcagc 


gggccctgac 


cggggccggg 


accgaggggg 


11880 


agagctactt 


tgacatgggc 


gcggacctgc 


gctggcagcc 


cagccgccgg 


gccttggaag 


11940 


ctgccggcgg 


ttccccctac 


gtggaggagg 


tggacgatga 


ggaggaggag 


ggcgagtacc 


12000 


tggaagactg 


atggcgcgac 


cgtatttttg 


ctagatgcag 


caacagccac 


cgcctcctga 


12060 


tcccgcgatg 


cgggcggcgc 


tgcagagcca 


gccgtccggc 


attaactcct 


cggacgattg 


12120 


gacccaggcc 


atgcaacgca 


tcatggcgct 


gacgacccgc 


aatcccgaag 


cctttagaca 


12180 


gcagcctcag 


gccaaccggc 


tctcggccat 


cctggaggcc 


gtggtgccct 


cgcgctcgaa 


12240 


ccccacgcac 


gagaaggtgc 


tggccatcgt 


gaacgcgctg 


gtggagaaca 


aggccatccg 


12300 


cggcgacgag 


gccgggctgg 


tgtacaacgc 


gctgctggag 


cgcgtggccc 


gctacaacag 


12360 


caccaacgtg 


cagacgaacc 


tggaccgcat 


ggtgaccgac 


gtgcgcgagg 


cggtgtcgca 


12420 


gcgcgagcgg 


ttccaccgcg 


agtcgaacct 


gggctccatg 


gtggcgctga 


acgccttcct 


12480 
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aaacacacaa 


cccaccaacq 


tqccccqqqq 

*-53 x - x - v " v -335353 


ccaqqaqqac 


tacaccaact 


tcatcagcgc 


12540 


actacqqctq 


atqqtqqccq 

*-5353 5353^-^-53 


aggtgcccca 


qagcqaggtg 

53** 53 **53**5353 53 


taccagtcgg 


ggccggacta 


12600 


cttcttccag 


accagtcgcc 


agggcttgca 


gaccgtgaac 


ctgagccagg 


ctttcaagaa 


12660 


cttacaaaqa 


ctqtqqqqcq 

w w 53 53535353 '"53 


tqcaqqcccc 

w 23 23 *3 ^* ^* 


qgtcgqggac 

5353 53 53 53 53**'- 


cgcgcgacgg 


tgtcgagcct 


12720 


actqacqccq 


aactcgcgcc 


tgctgctgct 


qctqqtqqcq 

53^ ""533 **5353*~53 


cccttcacgg 


acagcggcag 


12780 


cataaaccac 


gactcgtacc 


tgggctacct 


gcttaacctg 


taccgcgagg 


ccatcgggca 


12840 


aacacacqtq 

yy v y 53 *-53 


qacqaqcaqa 


cctaccagga 


gatcacccac 


gtgagccgcg 

53 w 53**53*-^53*"3 


cgctgggcca 


12900 


aqaqqacccq 


qqcaacctqq 
yy *-5353 


aggccaccct 


gaacttcctg 


ctgaccaacc 


ggtcgcagaa 


12960 


gatcccgccc 


caqtacqcqc 

^***23 ***^^*77^^77 


tgagcaccga 


ggaggagcgc 


atcctgcgct 


acgtgcagca 


13020 


qaqcqtqqqq 

53**3 % ~3 3333 


ctgttcctga 


tqcaqqaggg 

*■ 53 *«* ** 53 53 ** 53 53 53 


qqccacgccc 


agcgccgcgc 


tcgacatgac 


13080 


cqcqcqcaac 


atqqaqccca 


gcatgtacgc 


tcgcaaccgc 


ccgttcatca 


ataagctgat 


13140 


ggactacttg 


catcqqqcqq 

^'** vv- 535353 v "5353 


ccgccatgaa 


ctcggactac 


tttaccaacg 


ccatcttgaa 


13200 


cccgcactgg 


ctcccgccgc 


ccgggttcta 


cacqqqcqaq 


tacgacatgc 


ccgaccccaa 


13260 


cqacqqqttc 


ctqtqqqacq 

^- w 53 *-5353 53** v '53 


acgtggacag 


caqcqtqttc 

^v^^23 ^* 2J 23 


tcgccgcgcc 


ccgccaccac 


13320 


cqtqtqqaaq 


aaagagggcg 


qqqaccqqcq 

535353 v * v - v *'5353 , ""53 


gccgtcctcg 


qcqctgtccq 

53^»53*-*»53 *--*-*-53 


qtcqcgcggg 

23*»*-53' , -23' , "»53 23 23 


13380 


tgctgccgcg 


qcqqtqcctq 

53^5353 *- 3 *"53 


aqqccqccaq 

2j 23 23 ^* 23 


ccccttcccg 


agcctgccct 


tttcgctgaa 


13440 


caqcqtqcqc 

<w».«2 ^"jp 33 53 


aqcaqcqaqc 

**3 2j| 53 ^^3 


tqqqtcqqct 

w 3 3 3 ** ^* 3 3 ** 


qacqcqqccq 

53^ 53 % "5353^"**53 


cgcctgctgg 


gcgagqagqa 

3 3 33**33" 


13500 


gtacctgaac 


gactccttgt 


tgaggcccga 


qcqcqaqaaq 

J3 23 53^^53 53 


aacttcccca 


ataacgggat 


13560 


agagagcctg 


gtggacaaga 


tqaqccqctq 

**53**53 w w 53 w 53 


qaaqacqtac 

3 3 3 


qcqcacqaqc 

53 53 *- ** *3 ** 53 *» 


acagggacga 


13620 


gccccgagct 


aqcaqcaqcq 


caggcacccg 


tagacgccag 


cgacacgaca 


qqcaqcqqqq 
53 53 w 53 *" 53 53 53 53 


13680 


tctqqtqtqq 

33 *-3 *-33 


qacqatqaqq 

23 23 23 23 73 


attccgccga 


cgacagcagc 


gtqttqgact 
53 3 *- »- 33 


tgggtgqgag 

*- 3 3 3 w 535353**53 


13740 


tgqtgqtqqt 

5353 5353 *"3 3 ** 


aacccgttcg 


ctcacttgcg 


cccccgtatc 


gggcgcctga 


tgtaagaatc 


13800 


tgaaaaaata 


aaaaacggta 


ctcaccaagg 


ccatggcgac 


cagcgtgcgt 

3^-3 *• 53^13' ** 


tcttctctgt 


13860 


tgtttgtagt 


agtatgatga 


qqcqcqtqta 

53 53 ^"53 ^"53 **53 


cccqqaqqqt 

533"*333 


cctcctccct 


cgtacgagag 

53 53 **23** 53 


13920 


cqtqatqcaq 

*"J3 53 53 53 


caqqcqqtqq 

vu yy^yy *-5353 


cqqcqqcqat 

v -33 x -353 v "3 , "* ,v * 


qcaqcccccq 

*3 53 53 


ctqqaqqcqc 

*-*353**53 53 w 53** 


cttacgtgcc 


13980 


cccqcqqtac 


ctggcgccta 


cqqaqqqqcq 

^-535353535353^*53 


gaacagcatt 


cqttactcqq 


aqctqqcacc 

«a 23 ^* 23 77 ^* 


14040 


cttgtacgat 


accacccggt 


tqtacctqqt 


ggacaacaag 


tcqqcqqaca 


tcqcctcqct 


14100 


gaactaccag 


aacgaccaca 


gcaacttcct 


gaccaccgtg 


gtgcagaaca 


acgatttcac 


14160 


ccccacggag 


gccagcaccc 


agaccatcaa 


ctttgacgag 


cgctcgcggt 


qqqqcggcca 

5353533 33 


14220 


gctgaaaacc 


atcatgcaca 


ccaacatgcc 


caacgtgaac 


qaqttcatqt 


acagcaacaa 


14280 


qttcaaqqcq 

23 ^* 23 23 


caqqtqatqq 

^- 53 53 53 **53 53 


tctcqcqcaa 

« w ^* 23 25 **** 


qacccccaat 


qqqqtcqcqq 
5353533 *-*-3 u yy 


tqqatqaqaa 

w 2J3 53 53 ^j**** 


14340 


ttatqatqqt 


aqtcaqqacq 


agctgactta 


cqaqtqqqtq 

L 3 a 3 L 335l *-53 


qaqtttqaqc 


tacccaaaaa 


14400 


caact tctcg 


gtgaccatga 


ccatcgatct 


gatgaacaac 


gccatcatcg 


acaactactt 


144 bU 


ggcggtgggg 


cgtcagaacg 


gggtgctgga 


gagcgacatc 


ggcgtgaagt 


tcgacacgcg 


14520 


caacttccgg 


ctgggctggg 


accccgtgac 


cgagctggtg 


atgccgggcg 


tgtacaccaa 


14580 
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cgaggccttc 


caccccgaca 


tcgtcctgct 


gcccggctgc 


ggcgtggact 


tcaccgagag 


14640 


ccgcctcagc 


aacctgctgg 


gcatccgcaa 


gcggcagccc 


ttccaggagg 


gcttccagat 


14700 


cctgtacgag 


gacctggagg 


ggggcaacat 


ccccgcgctc 


ttggatgtcg 


aagcctatga 


14760 


gaaaagcaag 


gaggaggccg 


ccgcagcggc 


gaccgcagcc 


gtggccaccg 


cctctaccga 


14820 


ggtgcggggc 


gataattttg 


ctagcgccgc 


ggcagtggcc 


gaggcggctg 


aaaccgaaag 


14880 


taagatagtc 


atccagccgg 


tggagaagga 


cagcaaggac 


aggagctaca 


acgtgctcgc 


14940 


ggacaagaaa 


aacaccgcct 


accgcagctg 


gtacctggcc 


tacaactacg 


gcgaccccga 


15000 


gaagggcgtg 


cgctcctgga 


cgctgctcac 


cacctcggac 


gtcacctgcg 


gcgtggagca 


15060 


agtctactgg 


tcgctgcccg 


acatgatgca 


agacccggtc 


accttccgct 


ccacgcgtca 


15120 


agttagcaac 


tacccggtgg 


tgggcgccga 


gctcctgccc 


gtctactcca 


agagcttctt 


15180 


caacgagcag 


gccgtctact 


cgcagcagct 


gcgcgccttc 


acctcgctca 


cgcacgtctt 


15240 


caaccgcttc 


cccgagaacc 


agatcctcgt 


ccgcccgccc 


gcgcccacca 


ttaccaccgt 


15300 


cagtgaaaac 


gttcctgctc 


tcacagatca 


cgggaccctg 


ccgctgcgca 


gcagtatccg 


15360 


gggagtccag 


cgcgtgaccg 


tcactgacgc 


cagacgccgc 


acctgcccct 


acgtctacaa 


15420 


ggccctgggc 


gtagtcgcgc 


cgcgcgtcct 


ctcgagccgc 


accttctaaa 


aaatgtccat 


15480 


tctcatctcg 


cccagtaata 


acaccggttg 


gggcctgcgc 


gcgcccagca 


agatgtacgg 


15540 


aggcgctcgc 


caacgctcca 


cgcaacaccc 


cgtgcgcgtg 


cgcgggcact 


tccgcgctcc 


15600 


ctggggcgcc 


ctcaagggcc 


gcgtgcgctc 


gcgcaccacc 


gtcgacgacg 


tgatcgacca 


15660 


ggtggtggcc 


gacgcgcgca 


actacacgcc 


cgccgccgcg 


cccgcctcca 


ccgtggacgc 


15720 


cgtcatcgac 


agcgtggtgg 


ccgatgcgcg 


ccggtacgcc 


cgcgccaaga 


gccggcggcg 


15780 


gcgcatcgcc 


cggcggcacc 


ggagcacccc 


cgccatgcgc 


gcggcgcgag 


ccttgctgcg 


15840 


cagggccagg 


cgcacgggac 


gcagggccat 


gctcagggcg 


gccagacgcg 


cggcctccgg 


15900 


cagcagcagc 


gccggcagga 


cccgcagacg 


cgcggccacg 


gcggcggcgg 


cggccatcgc 


15960 


cagcatgtcc 


cgcccgcggc 


gcggcaacgt 


gtactgggtg 


cgcgacgccg 


ccaccggtgt 


16020 


gcgcgtgccc 


gtgcgcaccc 


gcccccctcg 


cacttgaaga 


tgctgacttc 


gcgatgttga 


16080 


tgtgtcccag 


cggcgaggag 


gatgtccaag 


cgcaaataca 


aggaagagat 


gctccaggtc 


16140 


atcgcgcctg 


agatctacgg 


ccccgcggtg 


aaggaggaaa 


gaaagccccg 


caaactgaag 


16200 


cgggtcaaaa 


aggacaaaaa 


ggaggaggaa 


gatgtggacg 


gactggtgga 


gtttgtgcgc 


16260 


gagttcgccc 


cccggcggcg 


cgtgcagtgg 


cgcgggcgga 


aagtgaaacc 


ggtgctgcgg 


16320 


cccggcacca 


cggtggtctt 


cacgcccggc 


gagcgttccg 


gctccgcctc 


caagcgctcc 


16380 


tacgacgagg 


tgtacgggga 


cgaggacatc 


ctcgagcagg 


cggtcgagcg 


tctgggcgag 


16440 


tttgcttacg 


gcaagcgcag 


ccgccccgcg 


cccttgaaag 


aggaggcggt 


gtccatcccg 


16500 


ctggaccacg 


gcaaccccac 


gccgagcctg 


aagccggtga 


ccctgcagca 


ggtgctgccg 


16560 


agcgcggcgc 


cgcgccgggg 


cttcaagcgc 


gagggcggcg 


aggatctgta 


cccgaccatg 


16620 
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cagctgatgg 


tgcccaagcg 


ccagaagctg 


gaggacgtgc 

3 33 3 3 


tggagcacat 


gaaggtggac 


16680 


cccgaqgtqc 


agcccgaggt 


caaggtgcgg 

33 3 33 


cccatcaagc 


aggtggcccc 


gggcctgggc 


16740 


gtgcagaccg 


tggacatcaa 


gatccccacg 


gagcccatgg 

3 3 ~33 


aaacgcagac 


cgagcccgtg 


16800 


aagcccagca 


ccagcaccat 


ggaggtgcag 

3 3 


acggatccct 


ggatgccggc 


gccggcttcc 


16860 


accactcgcc 


gaagacgcaa 


gtacggcgcg 

9 "•~~33~3 3 


gccagcctgc 


tgatgcccaa 


ctacgcgctg 


16920 


catccttcca 


tcatccccac 


gccgggctac 


cgcqgcacgc 


gcttctaccg 


cggctacacc 


16980 


aqcagccgcc 


gcaagaccac 


cacccgccgc 


cgccgtcgtc 


gcacccgccg 


cagcagcacc 


17040 


gcgacttccg 


ccgccgccct 


ggtgcggaga 

33 3333 


gtgtaccgca 


gcgggcgcga 


gcctctgacc 


17100 


ctgccgcgcg 

3 9 3 3 


cgcgctacca 


cccgagcatc 


gccatttaac 


tctgccgtcg 


cctcctactt 


17160 


gcagatatgg 


ccctcacatg 


ccgcctccgc 


gtccccatta 

9 w — 


cgggctaccg 


aggaagaaag 


17220 


ccgcgccgta 


gaaggctgac 


ggggaacggg 


ctgcgtcgcc 


atcaccaccg 


gcggcggcgc 


17280 


gccatcagca 


aqcqgttqqq 


gggaqgcttc 


ctgcccgcgc 


tgatccccat 


catcgccgcg 


17340 


gcgatcgggg 


cgatccccgg 


catagcttcc 


gtggcggtgc 

9 3333 3 


aggcctctca 


gcgccactga 


17400 


gacacagctt 


ggaaaatttg 


taataaaaaa 


atggactgac 


gctcctggtc 


ctgtgatgtg 


17460 


tgtttttaga 


tggaagacat 


caatttttcg 


tccctggcac 


cgcgacacgg 


cacgcggccg 


17520 


tttatgggca 


cctggagcga 


catcggcaac 


agccaactga 

9 3 


acgggggcgc 

99999 9 


cttcaattgg 


17580 


agcagtctct 


ggagcgggct 


taagaatttc 


gggtccacgc 


tcaaaaccta 


tggcaacaag 


17640 


gcgtggaaca 


gcagcacagg 


gcaqgcgctg 

3 3 3 3 3 


agggaaaagc 


tgaaagagca 


gaacttccag 


17700 


cagaaggtgg 

3 3 3 3 


tcgatggcct 


ggcctcgggc 

99 33 3 


atcaacgggg 


tggtggacct 


ggccaaccag 


17760 


gccgtgcaga 


aacagatcaa 


cagccgcctg 


gacgcggtcc 


cqcccqcqqq 


qtccqtqqaq 

3 ^^ 3 3 3 3 


17820 


atgccccagg 


tggaggagga 

33 33 33 


gctgcctccc 


ctggacaagc 


gcggcgacaa 


gcgaccgcgt 


17880 


cccgacgcgg 


aggagacgct 

T *~33»--3'-»'»»3"— — 


gctgacgcac 


acggacgagc 


cgcccccgta 


cgaggaggcq 


17940 


gtgaaactgg 


gtctgcccac 


cacgcggccc 


gtggcgcctc 

3 33 3 


tggccaccgg 


ggtgctgaaa 


18000 


cccagcagca 


gcagccagcc 


cgcgaccctg 


gacttgcctc 


cgcctgcttc 


ccgcccctcc 


18060 


acagtggcta 


agcccctgcc 


gccggtggcc 


gtcgcgtcgc 

3 3~3 3 


gcgccccccg 


aggccgcccc 


18120 


caggcgaact 


ggcagagcac 


tctgaacagc 


atcgtgggtc 

3 **3 3 3 


tgggagtgca 


gaqtgtqaaq 

3 3 3 fc 3****3 


18180 


cgccgccgct 


gctattaaaa 


gacactgtag 


cgcttaactt 


gcttgtctgt 


gtgtatatgt 


18240 


atgtccgccg 


accagaagga 


ggaagaggcg 

9 9 3 3 3 3 


cgtcgccgag 

3 3 3 3 


ttgcaagatg 


gccaccccat 


18300 


cgatgctgcc 


ccagtgggcg 

3 9 39 9 


tacatgcaca 


tcgccggaca 


ggacgcttcg 


gagtacctga 


18360 


gtccgggtct 

9 9 9 


ggtgcagttc 


gcccgcgcca 


cagacaccta 


cttcagtctg 


gggaacaagt 


18420 


ttaggaaccc 


cacggtggcg 

93 33 3 


cccacgcacg 


atgtgaccac 


cgaccgcagc 


caqcqqctqa 

3 3 3 w 3** 


18480 


cgctgcgctt 


cgtgcccgtg 

3 3 3 "*3 


gaccgcgaqq 

3 9 3 3 3 


acaacaccta 


ctcgtacaaa 


qtqcqctaca 


18540 


cgccggccgL 


gggcgacaac 


cgcgrgctgg 


acatggccag 


cacc tac t t l 


gacatccgcg 


looUU 


gcgtgctgga 


tcgggggccc 


agcttcaaac 


cctactccgg 


caccgcctac 


aacagcctgg 


18660 


ctcccaaggg 


agcgcccaac 


acttgccagt 


ggacatataa 


agctggtgat 


actgatacag 


18720 
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aaaaaaccta 


tacatatgga 


aatgcacctg 


tgcaaggcat 


tagcattaca 


aaggatggta 


18780 


ttcaacttgg 


aactgacagc 


gatggtcagg 


caatctatgc 


agacgaaact 


tatcaaccag 


18840 


agcctcaagt 


gggtgatgct 


gaatggcatg 


acatcactgg 


tactgatgaa 


aaatatggag 


18900 


gcagagctct 


taagcctgac 


accaaaatga 


agccttgcta 


tggttctttt 


gccaagccta 


18960 


ccaataaaga 


aggaggccag 


gcaaatgtga 


aaaccgaaac 


aggcggtacc 


aaagaatatg 


19020 


acattgacat 


ggcattcttc 


gataatcgaa 


gtgcagctgc 


cgccggccta 


gccccagaaa 


19080 


ttgttttgta 


tactgagaat 


gtggatctgg 


aaactccaga 


tacccatatt 


gtatacaagg 


19140 


caggtacaga 


tgacagtagc 


tcttctatca 


atttgggtca 


gcagtccatg 


cccaacagac 


19200 


ccaactacat 


tggcttcaga 


gacaacttta 


tcggtctgat 


gtactacaac 


agcactggca 


19260 


atatgggtgt 


actggctgga 


caggcctccc 


agctgaatgc 


tgtggtggac 


ttgcaggaca 


19320 


gaaacaccga 


actgtcctac 


cagctcttgc 


ttgactctct 


gggtgacaga 


accaggtatt 


19380 


tcagtatgtg 


gaatcaggcg 


gtggacagtt 


atgaccccga 


tgtgcgcatt 


attgaaaatc 


19440 


acggtgtgga 


ggatgaactt 


cctaactatt 


gcttccccct 


ggatgctgtg 


ggtagaactg 


19500 


atacttacca 


gggaattaag 


gccaatggtg 


ataatcaaac 


cacctggacc 


aaagatgata 


19560 


ctgttaatga 


tgctaatgaa 


ttgggcaagg 


gcaatccttt 


cgccatggag 


atcaacatcc 


19620 


aggccaacct 


gtggcggaac 


ttcctctacg 


cgaacgtggc 


gctgtacctg 


cccgactcct 


19680 


acaagtacac 


gccggccaac 


atcacgctgc 


ccaccaacac 


caacacctac 


gattacatga 


19740 


acggccgcgt 


ggtggcgccc 


tcgctggtgg 


acgcctacat 


caacatcggg 


gcgcgctggt 


19800 


cgctggaccc 


catggacaac 


gtcaacccct 


tcaaccacca 


ccgcaacgcg 


ggcctgcgat 


19860 


accgctccat 


gctcctgggc 


aacgggcgct 


acgtgccctt 


ccacatccag 


gtgccccaaa 


19920 


agtttttcgc 


catcaagagc 


ctcctgctcc 


tgcccgggtc 


ctacacctac 


gagtggaact 


19980 


tccgcaagga 


cgtcaacatg 


atcctgcaga 


gctccctcgg 


caacgacctg 


cgcacggacg 


20040 


gggcctccat 


cgccttcacc 


agcatcaacc 


tctacgccac 


cttcttcccc 


atggcgcaca 


20100 


acaccgcctc 


cacgctcgag 


gccatgctgc 


gcaacgacac 


caacgaccag 


tccttcaacg 


20160 


actacctctc 


ggcggccaac 


atgctctacc 


ccatcccggc 


caacgccacc 


aacgtgccca 


20220 


tctccatccc 


ctcgcgcaac 


tgggccgcct 


tccgcggctg 


gtccttcacg 


cgcctcaaga 


20280 


cccgcgagac 


gccctcgctc 


ggctccgggt 


tcgaccccta 


cttcgtctac 


tcgggctcca 


20340 


tcccctacct 


cgacggcacc 


ttctacctca 


accacacctt 


caagaaggtc 


tccatcacct 


20400 


tcgactcctc 


cgtcagctgg 


cccggcaacg 


accgcctcct 


gacgcccaac 


gagttcgaaa 


20460 


tcaagcgcac 


cgtcgacgga 


gaggggtaca 


acgtggccca 


gtgcaacatg 


accaaggact 


20520 


ggttcctggt 


ccagatgctg 


gcccactaca 


acatcggcta 


ccagggcttc 


tacgtgcccg 


20580 


agggctacaa 


ggaccgcatg 


tactccttct 


tccgcaactt 


ccagcccatg 


agccgccagg 


20640 


tcgtggacga 


ggtcaactac 


aaggactacc 


aggccgtcac 


cctggcctac 


cagcacaaca 


20700 


a ctcgggctt 


cgtcggctac 


ctcgcgccca 


ccatgcgcca 


gggccagccc 


taccccgcca 


20760 
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actaccccta 


cccgctcatc 


qqcaaqaqcq 

33*— , "* v *3 , "*'3^"3 


ccgtcgccag 


cgtcacccag 


aaaaagttcc 


20820 


tctacaacca 


aatcatatqa 


cgcatcccct 


tctccagcaa 


cttcatgtcc 


atgggcgcgc 


20880 


tcaccgacct 


cqqccaqaac 


atgctctacg 


ccaactccgc 


ccacgcgcta 


gacatgaatt 


20940 


tcgaagtcga 


ccccatggat 


gagtccaccc 

3 3 


ttctctatgt 


tgtcttcgaa 


gtcttcgacg 


21000 


tcgtccgagt 


gcaccagccc 


caccgcggcg 


tcatcgaggc 


cgtctacctg 


cgcacgccct 


21060 


tctcaaccaa 


caacgccacc 


acctaagcct 


cttgcttctt 


gcaagatgac 


ggcctgcgcg 


21120 


qqctccqqcq 


agcaggagct 


cagggccatc 


ctccgcgacc 


tgggctgcgg 


gccctgcttc 


21180 


ctgggcacct 


tcgacaagcg 


cttcccggga 


ttcatggccc 


cgcacaagct 


ggcctgcgcc 


21240 


atcgtcaaca 


cqqccqqccq 

33 33 ^" 3 


cgagaccggg 


ggcgagcact 


ggctggcctt 


cgcctggaac 


21300 


ccgcgctccc 


acacctgcta 


cctcttcgac 


cccttcgggt 


tctcggacga 


gcgcctcaag 


21360 


cagatctacc 


aqttcqaqta 


cqaqqgcctg 

v *3 k *333 WN " 3 


ctgcgtcgca 


gcgccctggc 


caccgaggac 


21420 


cactQcatca 

> — 3 3 3 w 


ccctqqaaaa 


gtccacccag 


accgtgcagg 


gtccgcgctc 


ggccgcctgc 


21480 


qqqctcttct 


qctqcatqtt 


cctgcacgcc 


ttcgtgcact 


ggcccgaccg 

ZJ ZJ ZJ 3 


ccccatggac 


21540 


aaqaacccca 


ccatgaactt 


actqacqqqq 

3 W k *3 MV '3333 


gtgcccaacg 


gcatgctcca 


gtcgccccag 


21600 


qtqqaaccca 

W *A X* ^fl Xs \A 


ccctacaccq 


caaccaggag 


gcgctctacc 


gcttcctcaa 


cgcccactcc 


21660 


gcctactttc 


gctcccaccg 


cgcgcgcatc 


gagaaggcca 


ccgccttcga 


ccgcatgaat 


21720 


caagacatgt 


aatccqqtqt 


qtqtatgtga 

3 w 3 w *~ *- 3 ^ 3 *~ 


atgctttatt 


catcataata 


aacagcacat 


21780 


gtttatgcca 


ccttctctga 


ggctctgact 


ttatttagaa 


atcgaagggg 


ttctgccggc 


21840 


tctcggcatg 


gcccgcgggc 


agggatacgt 


tgcggaactg 


gtacttgggc 


agccacttga 


21900 


actcggggat 


cagcagcttc 


ggcacgggga 


ggtcggggaa 

33 ~ ~ ZJ Zj ZJ ZJ 


cgagtcgctc 


cacagcttgc 


21960 


qcatqaqttq 


cagggcgccc 


agcaggtcgg 


gcgcggagat 


cttgaaatcg 


cagttgggac 


22020 


ccgcgttctg 


cqcqcqaqaq 


ttacggtaca 


cggggttgca 


gcactggaac 


accatcaggg 


22080 


ccggqtqctt 

w w 3 3 3 *"3 ^" w 


cacgctcgcc 


agcaccgtcg 


cgtcggtgat 


gccctccacg 


tccagatcct 


22140 


cqqcqttqqc 

3 3 ^3 *- *-33 w 


catcccgaag 


ggggtcatct 


tgcaggtctg 


ccgccccatg 


ctgggcacgc 

ZJ ZJ ZJ ZJ 


22200 


aqccqqqctt 


qtqqttqcaa 

3 L 33 w 3 


tcqcagtgca 


gggggatcaq 

33333"*"* 3 


catcatctgg 


gcctgctcgg 

3 3 33 


22260 


agctcatgcc 


cgggtacatg 


gccttcatga 


aagcctccag 


ctggcggaag 

33 ^"33 3 


qcctgctgcg 

ZJ 3 3 3 


22320 


ccttgccgcc 


ctcggtgaag 


aagaccccgc 


aggacttgct 

w 33 3 


agagaactgg 


ttggtggcgc 

33 33 3 


22380 


agccagcgtc 


qtqcacqcaq 

25 ^ 27 ^^^^ ^ ^# ^^^^23 


cagcgcgcgt 


cgttgttggc 

3 »■ ^3 ~ "33 


cagctgcacc 


acgctgcgcc 


22440 


cccagcggtt 


ctqggtgatc 

^ z?3 3 w 3 


ttggcccggt 


cggggttctc 


cttcagcgcg 


cgctgcccgt 


22500 


tctcgctcgc 


cacatccatc 


tcgatcgtgt 


gctccttctg 


gatcatcacg 


gtcccgtgca 


22560 


ggcaccgcag 

ZJ ZJ 3 3 


cttgccctcg 


gcctcggtgc 


acccgtgcag 


ccacagcgcg 


cagccggtgc 


22620 


tctcccagtt 


cttqtqggcg 


atctgggagt 


gcgagtgcac 

3 3 3 3 


gaagccctgc 


aggaagcggc 


22680 


ccatcatcgt 


ggtcagggtc 


ttgt xgctgg 


rgaagg ucag 


cggaaxgccg 




£.£./ H\J 


cgttcacata 


caggtggcag 


atacggcggt 


acacctcgcc 


ctgctcgggc 


atcagctgga 


22800 


aggcggactt 


caggtcgctc 


tccacgcggt 


accggtccat 


cagcagcgtc 


atcacttcca 


22860 
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tgcccttctc ccaggccgaa acgatcggca ggctcagggg gttcttcacc gttgtcatct 22920 

tagtcgccgc cgccgaagtc agggggtcgt tctcgtccag ggtctcaaac actcgcttgc 22980 

cgtccttctc ggtgatgcgc acggggggaa agctgaagcc cacggccgcc agctcctcct 23040 

cggcctgcct ttcgtcctcg ctgtcctggc tgatgtcttg caaaggcaca tgcttggtct 23100 

tgcggggttt ctttttgggc ggcagaggcg gcggcggaga cgtgctgggc gagcgcgagt 23160 

tctcgctcac cacgactatt tcttctcctt ggccgtcgtc cgagaccacg cggcggtagg 23220 

catgcctctt ctggggcaga ggcggaggcg acgggctctc gcggttcggc gggcggctgg 23280 

cagagcccct tccgcgttcg ggggtgcgct cctggcggcg ctgctctgac tgacttcctc 23340 

cgcggccggc cattgtgttc tcctagggag caagcatgga gactcagcca tcgtcgccaa 23400 

catcgccatc tgcccccgcc gccgccgacg agaaccagca gcagcagaat gaaagcttaa 23460 

ccgccccgcc gcccagcccc acctccgacg ccgcagcccc agacatgcaa gagatggagg 23520 

aatccatcga gattgacctg ggctacgtga cgcccgcgga gcacgaggag gagctggcag 23580 

cgcgcttttc agccccggaa gagaaccacc aagagcagcc agagcaggaa gcagagagcg 23640 

agcagaacca ggctgggctc gagcatggcg actacctgag cggggcagag gacgtgctca 23700 

tcaagcatct ggcccgccaa tgcatcatcg tcaaggacgc gctgctcgac cgcgccgagg 23760 

tgcccctcag cgtggcggag ctcagccgcg cctacgagcg caacctcttc tcgccgcgcg 23820 

tgccccccaa gcgccagccc aacggcacct gcgagcccaa cccgcgcctc aacttctacc 23880 

cggtcttcgc ggtgcccgag gccctggcca cctaccacct ctttttcaag aaccaaagga 23940 

tccccgtctc ctgccgcgcc aaccgcaccc gcgccgacgc cctgctcaac ctgggccccg 24000 

gcgcccgcct acctgatatc gcctccttgg aagaggttcc caagatcttc gagggtctgg 24060 

gcagcgacga gactcgggcc gcgaacgctc tgcaaggaag cggagaggag catgagcacc 24120 

acagcgccct ggtggagttg gaaggcgaca acgcgcgcct ggcggtcctc aagcgcacgg 24180 

tcgagctgac ccacttcgcc tacccggcgc tcaacctgcc ccccaaggtc atgagcgccg 24240 

tcatggacca ggtgctcatc aagcgcgcct cgcccctctc ggaggaggag atgcaggacc 24300 

ccgagagctc ggacgagggc aagcccgtgg tcagcgacga gcagctggcg cgctggctgg 24360 

gagcgagtag caccccccag agcctggaag agcggcgcaa gctcatgatg gccgtggtcc 24420 

tggtgaccgt ggagctggag tgtctgcgcc gcttcttcgc cgacgcggag accctgcgca 24480 

aggtcgagga gaacctgcac tacctcttca gacacgggtt cgtgcgccag gcctgcaaga 24540 

tctccaacgt ggagctgacc aacctggtct cctacatggg catcctgcac gagaaccgcc 24600 

tggggcagaa cgtgctgcac accaccctgc gcggggaggc ccgccgcgac tacatccgcg 24660 

actgcgtcta cctgtacctc tgccacacct ggcagacggg catgggcgtg tggcagcagt 24720 

gcctggagga gcagaacctg aaagagctct gcaagctcct gcagaagaac ctcaaggccc 24780 

tgtggaccgg gttcgacgag cgcaccaccg ccgcggacct ggccgacctc atcttccccg 24840 

agcgcctgcg gctgacgctg cgcaacgggc tgcccgactt tatgagccaa agcatgttgc 24900 
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aaaactttcg ctctttcatc 


ctcgaacgct 


ccaoaatcct 


acccaccacc 


tgctccgcgc 


24960 


tqccctcqqa cttcqtqccq 

v 3** 3 3 3 3 3 


ctgaccttcc 


acaaataccc 

3 w y u zJ 


cccaccqctc 


tggagccact 


25020 


gctacctgct gcgcctggcc 


aactacctgg 


cctaccactc 


ggacgtgatc 


qaqqacqtca 


25080 


gcqgcgaggg cctgctcgag 


tgccactgcc 


gctgcaacct 


ctqcacqccq 


caccgctccc 


25140 


tggcctgcaa cccccagctg 


ctgagcgaga 

^33~ 33 3 3 


cccagatcat 


cggcaccttc 


qagttgcaaq 


25200 


gccccqgcqa qqgcaaqgqq 


ggtctgaaac 


tcaccccqqq 

w -w w 333 3 


qctqtqqacc 

3 *•* v 3 *-333*'**- w 


tcggcctact 


25260 


tgcgcaagtt cgtgcccgag 


gactaccatc 


ccttcgagat 


caggttctac 


qaaaaccaat 


25320 


cccagccgcc caaggccgag 


ctgtcggcct 


qcqtcatcac 


ccaqqqqqcc 

wv,u 3 3 3 33 3*"** 


atcctaaccc 


25380 


aattgcaagc catccagaaa 


tcccgccaag 


aatttctgct 


gaaaaagggc 


cacggggtct 


25440 


acttggaccc ccagaccgga 


gaggagctca 


accccagctt 


cccccaggat 


qccccqaqqa 


25500 


agcagcaaga agctgaaagt 


ggagctgccg 


ccgccgccgg 


aqqatttqqa 


qqaaaactaa 


25560 


gagagcaqtc aqqcaqaqqa 


qqaqqaqatq 

D3J«*v)3J«*3»* *"3 


qaaqactqaa 


acaacactca 


aacaaaaaaa 


25620 


gacagcctgc aagacagtct 


qqaqqaqqaa 


qacqaqqtqq 


aqqaaqcaaa 

** *3 33 ** j3 33 *" y ** 


aaaaaaaaca 


25680 


gccgccgcca gaccgtcgtc 


ctcqqcqqaq 


qaqqaqaaaa 

33** 33 3 ****w*%j 


caagcagcac 


aaataccatc 


25740 


tccgctccqq qtcqqqqtcq 

3 3 3 3 3333 w ^" 3 


cqqcqqccaa 


gcccacagta 


aataaaacaa 

33** *-g33 33***- , y**' 




25800 


ttcccgaacc ccaccaccca 


qaccqqtaaq 


aaqqaqcqac 


aaaaatacaa 




25860 


gggcacaaaa acgccatcgt 


ctcctgcttg 


caagcctgcg 


qqqqcaacat 


ctccttcacc 


25920 


cggcgctacc tgctcttcca 


ccgcggqgtg 

***3**33S1SI w 3 


aacttccccc 


gcaacatctt 


gcattactac 


25980 


cgtcacctcc acagccccta 


ctactgtttc 


caagaagagg 


cagaaaccca 


acaacaacaa 

33 *— **33 w% *33 **-**23 


26040 


cagcagcaga aaaccagcgg 


cagcagctag 


aaaatccaca 


qcqqcqqcaq 


qtaqactaaa 

33 *-3 3""*- L y w 3 


26100 


gatcgcqqcq aacqaqccqq 

Z? 3 3 3 3 ^- 23 3 33 


cgcagacccg 


qqaqctaaaa 

3 3**3*- L 3 U 3 3 


aaccaaatct 


"ttcccaccct 


26160 


ctatgccatc ttccagcaga 


qtcqqqqqca 

33 33333 wv * 


agagcaggaa 


ctqaaaatca 


aaaaccattic 


26220 

Cm \Jtm fm \J 


tctgcgctcg ctcacccgca 


gttqtctqta 


tcacaagagc 


qaaqaccaac 


ttcaacacac 


26280 


tctcgaggac gccgaggctc 


tcttcaacaa 


gtactgcgcg 


ctcactctta 


aagagtagcc 


26340 


cgcgcccgcc cacacacgga 


aaaaggcqgg 

3 3*"333 


aattacgtca 


ccacctgcgc 


ccttcaccca 


26400 


accatcatca tgagcaaaga 


gattcccacg 


ccttacatgt 


aaaactacca 

3333**23 %J " WMVv*a«lA 


accccaaata 

33 w *w ** \H c* «• 23 


26460 


ggcctqqccq ccqqcqccqc 

33 ** 23 3 3 33 3 3 


ccaggactac 


tccacccgca 


taaactaact 


i -33*-'*-3333 


26520 


cccgcgatga tctcacgggt 


gaatgacatc 


cgcgcccacc 


aaaaccaaat 


actcctaaaa 

%*%» *V>V l» %A *H **U. 


26580 

C \J J \J \J 


cagtcagcga tcaccgccac 


gccccgccat 


caccttaatc 


cqcataatta 


acccaccacc 

33 *" *~3 v>w 


26640 


ctggtgtacc aggaaattcc 


ccagcccacg 


accgtactac 


ttccacaaaa 


CQCCCcLQQCC 

*— *J V* V. *•»** W *■ 


26700 


gaagtccagc tgactaactc 


aqqtqtccaq 


ctqqccqqca 


qcaccaccct 

23 33 3 **» v» *» 


atatcatrcac 


26760 

C\J 1 \J\J 


cgccccgctc agggtataaa 


qcggctgqtg 

3 33^" **33 w 33 


atccqaaaca 

•* *» *»33 3 3 


aaaacacaca 

*J ** r3 Tj Vi4 W%4 


actcaacaac 


26820 


gagguggLga gc lc l ucgc l 


gggtctgcga 


cctgacggag 


tcttccaact 


cgccggatcg 


26880 


gggagatctt ccttcacgcc 


tcgtcaggcc 


gtcctgactt 


tggagagttc 


gtcctcgcag 


26940 


ccccgctcgg gtggcatcgg 


cactctccag 


ttcgtggagg 


agttcactcc 


ctcggtctac 


27000 
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ttcaacccct 


tctccggctc 


ccccggccac 


tacccggacg 


agttcatccc 


gaacttcgac 


27060 


gccatcagcg 


agtcggtgga 


cggctacgat 


tgaatgtccc 


atggtggcgc 


ggctgaccta 


27120 


gctcggcttc 


gacacctgga 


ccactgccgc 


cgcttccgct 


gcttcgctcg 


ggatctcgcc 


27180 


gagtttgcct 


actttgagct 


gcccgaggag 


caccctcagg 


gcccggccca 


cggagtgcgg 


27240 


atcgtcgtcg 


aagggggtct 


cgactcccac 


ctgcttcgga 


tcttcagcca 


gcgtccgatc 


27300 


ctggccgagc 


gcgagcaagg 


acagaccctt 


ctgaccctgt 


actgcatctg 


caaccacccc 


27360 


ggcctgcatg 


aaagtctttg 


ttgtctgctg 


tgtactgagt 


ataataaaag 


ctgagatcag 


27420 


cgactactcc 


ggacttccgt 


gtgttcctgc 


tatcaaccag 


tccctgttct 


tcaccgggaa 


27480 


cgagaccgag 


ctccagctcc 


agtgtaagcc 


ccacaagaag 


tacctcacct 


ggctgttcca 


27540 


gggctctccg 


atcgccgttg 


tcaaccactg 


cgacaacgac 


ggagtcctgc 


tgagcggccc 


27600 


tgccaacctt 


actttttcca 


cccgcagaag 


caagctccag 


ctcttccaac 


ccttcctccc 


27660 


cgggacctat 


cagtgcgtct 


cgggaccctg 


ccatcacacc 


ttccacctga 


tcccgaatac 


27720 


cacagcgtcg 


ctccccgcta 


ctaacaacca 


aactacccac 


caacgccacc 


gtcgcgacct 


27780 


ttcctctggg 


tctaatacca 


ctaccggagg 


tgagctccga 


ggtcgaccaa 


cctctgggat 


27840 


ttactacggc 


ccctgggagg 


tggtagggtt 


aatagcgcta 


ggcctagttg 


cgggtgggct 


27900 


tttggctctc 


tgctacctat 


acctcccttg 


ctgttcgtac 


ttagtggtgc 


tgtgttgctg 


27960 


gtttaagaaa 


tggggaagat 


caccctagtg 


agctgcggtg 


tgctggtggc 


ggtggtgctt 


28020 


tcgattgtgg 


gactgggcgg 


cgcggctgta 


gtgaaggaga 


aggccgatcc 


ctgcttgcat 


28080 


ttcaatcccg 


acaaatgcca 


gctgagtttt 


cagcccgatg 


gcaatcggtg 


cgcggtgctg 


28140 


atcaagtgcg 


gatgggaatg 


cgagaacgtg 


agaatcgagt 


acaataacaa 


gactcggaac 


28200 


aatactctcg 


cgtccgtgtg 


gcagcccggg 


gaccccgagt 


ggtacaccgt 


ctctgtcccc 


28260 


ggtgctgacg 


gctccccgcg 


caccgtgaat 


aatactttca 


tttttgcgca 


catgtgcgac 


28320 


acggtcatgt 


ggatgagcaa 


gcagtacgat 


atgtggcccc 


ccacgaagga 


gaacatcgtg 


28380 


gtcttctcca 


tcgcttacag 


cgtgtgcacg 


gcgctaatca 


ccgctatcgt 


gtgcctgagc 


28440 


attcacatgc 


tcatcgctat 


tcgccccaga 


aataatgccg 


aaaaagaaaa 


acagccataa 


28500 


cacgtttttt 


cacacacctt 


tttcagacca 


tggcctctgt 


taaatttttg 


cttttatttg 


28560 


ccagtctcat 


tgccgtcatt 


catggaatga 


gtaatgagaa 


aattactatt 


tacactggca 


28620 


ctaatcacac 


attgaaaggt 


ccagaaaaag 


ccacagaagt 


ttcatggtat 


tgttatttta 


28680 


atgaatcaga 


tgtatctact 


gaactctgtg 


gaaacaataa 


caaaaaaaat 


gagagcatta 


28740 


ctctcatcaa 


gtttcaatgt 


ggatctgact 


taaccctaat 


taacatcact 


agagactatg 


28800 


taggtatgta 


ttatggaact 


acagcaggca 


tttcggacat 


ggaattttat 


caagtttctg 


28860 


tgtctgaacc 


caccacgcct 


agaatgacca 


caaccacaaa 


aactacacct 


gttaccacta 


28920 


tacagctcac 


taccaatggc 


tttcttgcca 


tgcttcaagt 


ggctgaaaat 


agcaccagca 


28980 


ttcaacccac 


cccacccagt 


gaggaaattc 


ccagatccat 


gattggcatt 


attgttgctg 


29040 
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taataatata 


catgttgatc 


atcgccttgt 


gcatqqtgta 


ctatgccttc 


tgctacagaa 


29100 


agcacagact 


gaacgacaag 


ctggaacact 


tactaagtgt 


tgaattttaa 


ttttttagaa 


29160 


ccatgaagat 


cctaggcctt 


ttagtttttt 


ctatcattac 


ctctgctcta 


tgcaattctg 


29220 


acaatgagga 


cgttactgtc 


qttqtcqqat 


caaattatac 


actaaaaggt 


ccagcaaaag 

7 7 


29280 


gtatgctttc 


gtggtattgt 


tggttcggaa 


ctgacgagca 


acagacagaa 


ctttgcaatg 


29340 


ctcaaaaagg 


caaaacctca 


aattctaaaa 


tctctaatta 


tcaatgcaat 


ggcactgact 


29400 


taatattact 


caatqtcacq 


aaagcatatg 


ctqqcaqtta 


cacctgccct 


ggagatgatg 


29460 


ccgacaatat 


gattttttac 


aaagtggaag 


tggttgatcc 


cactactcca 


ccgcccacca 


29520 


ccacaactac 


tcataccaca 


cacacagaac 


aaacaccaga 


ggcagcagaa 


gcagagttgg 

7 7 7 77 


29580 


ccttccaqqt 


tcacggagat 


tcctttgctg 


tcaatacccc 


tacacccgat 


cagcggtgtc 

7 77 7 


29640 


cggqqctqct 


cgtcagcgqc 

3 3 3 3 


attgtcggtg 


tgctttcggg 


attagcagtc 


ataatcatct 


29700 


gcatgttcat 


ttttgcttgc 


tgctatagaa 


ggctttaccg 


acaaaaatca 


gacccactgc 


29760 


tgaacctcta 


tgtttaattt 


tttccagagc 


catgaaggca 


gttagcgctc 


tagttttttg 


29820 


ttctttgatt 


ggcattgttt 


ttagtgctgg 

3 3 3 3 


gtttttgaaa 


aatcttacca 


tttatgaagg 


29880 


tgagaatgcc 


actctagtgg 


gcatcagtgg 

^3 3 13 13 


tcaaaatgtc 


agctggctaa 


aataccatct 


29940 


aqatqqqtqq 

-3 -3 -J -? w 3 3 


aaagacattt 


qcqattqqaa 

-3 -3 *- '"JJ w 


tgtcactgtg 


tatacatgta 


atggagttaa 


30000 


cctcaccatt 


actaatgcca 


cccaagatca 


gaatggtagg 


tttaagggcc 


agagtttcac 


30060 


tagaaataat 


qggtatgaat 

3 3 3 -3 


cccataacat 


gtttatctat 


gacgtcactg 


tcatcagaaa 


30120 


tgagactgcc 


accaccacac 


agatgcccac 


tacacacagt 


tctaccacta 


ctaccatgca 


30180 


aaccacacag 


acaaccacta 


catcaactca 


gcatatgacc 


accactacag 


cagcaaagcc 


30240 


aagtagtgca 


gcqcctcaqc 


cccaggcttt 


ggctttgaaa 


gctgcacaac 


ctagtacaac 


30300 


tactaggacc 


aatgagcaga 


ctactgaatt 


tttgtccact 


gtcgagagcc 


acaccacagc 


30360 


tacctccagt 


gccttctcta 


gcaccgccaa 


tctctcctcg 


ctttcctcta 


caccaatcag 


30420 


tcccgctact 


actcccaccc 


cagctcttct 


ccccactccc 


ctgaagcaaa 


ctgaggacag 


30480 


cggcatgcaa 


tggcagatca 


ccctgctcat 


tgtgatcggg 

3 ~ 3 937 


ttggtcatcc 


tggccgtgtt 


30540 


gctctactac 


atcttctgcc 


gccgcattcc 


caacgcgcac 


cgcaaaccgg 


cctacaagcc 


30600 


catcgttatc 


qggcaqccgg 


agccgcttca 


qgtggaaggg 

73 »-77*-*— 777 


ggtctaagga 


atcttctctt 


30660 


ctcttttaca 


gtatggtgat 


tgaactatga 


ttcctagaca 


attcttgatc 


actattctta 


30720 


tctgcctcct 


ccaagtctgt 


gccaccctcg 


ctctgqtqgc 

fc 3 J 3 3 


caacgccagt 


ccagactgta 


30780 


ttgggccctt 


cgcctcctac 


gtgctctttg 


ccttcatcac 


ctgcatctgc 


tgctgtagca 


30840 


tagtctgcct 


gcttatcacc 


ttcttccagt 


tcattgactg 


gatctttgtg 


cgcatcgcct 


30900 


acctgcgcca 


ccacccccag 


taccgcgacc 


aqcqaqtqqc 


gcggctgctc 


aqqctcctct 


30960 


gataagcatg 


cgggctctgc 


tacttctcgc 


gcttctgctg 


ttagtgctcc 


cccgccccgt 




cgacccccgg 


tcccccactc 


agtcccccga 


agaggtccgc 


aaatgcaaat 


tccaagaacc 


31080 


ctggaaattc 


ctcaaatgct 


accgccaaaa 


atcagacatg 


cttcccagct 


ggatcatgat 


31140 
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cattgggatc 


gtgaacattc 


tggcctgcac 


cctcatctcc 


tttgtgattt 


acccctgctt 


31200 


tgactttggt 


tggaactcgc 


cagaggcgct 


ctatctcccg 


cctgaacctg 


acacaccacc 


31260 


acagcaacct 


caggcacacg 


cactaccacc 


accacagcct 


aggccacaat 


acatgcccat 


31320 


attagactat 


gaggccgagc 


cacagcgacc 


catgctcccc 


gctattagtt 


acttcaatct 


31380 


aaccggcgga 


gatgactgac 


ccactggcca 


acaacaacgt 


caacgacctt 


ctcctggaca 


31440 


tggacggccg 


cgcctcggag 


cagcgactcg 


cccaacttcg 


cattcgccag 


cagcaggaga 


31500 


gagccgtcaa 


ggagctgcag 


gacggcatag 


ccatccacca 


gtgcaagaaa 


ggcatcttct 


31560 


gcctggtgaa 


acaggccaag 


atctcctacg 


aggtcacccc 


gaccgaccat 


cgcctctcct 


31620 


acgagctcct 


gcagcagcgc 


cagaagttca 


cctgcctggt 


cggagtcaac 


cccatcgtca 


31680 


tcacccagca 


gtcgggcgat 


accaaggggt 


gcatccactg 


ctcctgcgac 


tcccccgact 


31740 


gcgtccacac 


tctgatcaag 


accctctgcg 


gcctccgcga 


cctcctcccc 


atgaactaat 


31800 


caccccctta 


tccagtgaaa 


taaatatcat 


attgatgatg 


atttaaataa 


aaaataatca 


31860 


tttgatttga 


aataaagata 


caatcatatt 


gatgatttga 


gttttaaaaa 


ataaagaatc 


31920 


acttacttga 


aatctgatac 


caggtctctg 


tccatgtttt 


ctgccaacac 


cacctcactc 


31980 


ccctcttccc 


agctctggta 


ctgcagaccc 


cggcgggctg 


caaacttcct 


ccacacgctg 


32040 


aaggggatgt 


caaattcctc 


ctgtccctca 


atcttcattt 


tatcttctat 


cagatgtcca 


32100 


aaaagcgcgt 


ccgggtggat 


gatgacttcg 


accccgtcta 


cccctacgat 


gcagacaacg 


32160 


caccgaccgt 


gcccttcatc 


aaccccccct 


tcgtctcttc 


agatggattc 


caagagaagc 


32220 


ccctgggggt 


gctgtccctg 


cgactggctg 


accccgtcac 


caccaagaac 


ggggaaatca 


32280 


ccctcaagct 


gggagagggg 


gtggacctcg 


actcctcggg 


aaaactcatc 


tccaacacgg 


32340 


ccaccaaggc 


cgccgcccct 


ctcagttttt 


ccaacaacac 


catttccctt 


aacatggata 


32400 


cccctcttta 


taccaaagat 


ggaaaattat 


ccttacaagt 


ttctccaccg 


ttaaacatat 


32460 


taaaatcaac 


cattctgaac 


acattagctg 


tagcttatgg 


atcaggttta 


ggactgagtg 


32520 


gtggcactgc 


tcttgcagta 


cagttggcct 


ctccactcac 


ttttgatgaa 


aaaggaaata 


32580 


ttaaaattaa 


cctagccagt 


ggtccattaa 


cagttgatgc 


aagtcgactt 


agtatcaact 


32640 


gcaaaagagg 


ggtcactgtc 


actacctcag 


gagatgcaat 


tgaaagcaac 


ataagctggc 


32700 


ctaaaggtat 


aagatttgaa 


ggtaatggca 


tagctgcaaa 


cattggcaga 


ggattggaat 


32760 


ttggaaccac 


tagtacagag 


actgatgtca 


cagatgcata 


cccaattcaa 


gttaaattgg 


32820 


gtactggcct 


tacctttgac 


agtacaggcg 


ccattgttgc 


ttggaacaaa 


gaggatgata 


32880 


aacttacatt 


atggaccaca 


gccgacccct 


cgccaaattg 


caaaatatac 


tctgaaaaag 


32940 


atgccaaact 


cacactttgc 


ttgacaaagt 


gtggaagtca 


aattctgggt 


actgtgactg 


33000 


tattggcagt 


gaataatgga 


agtctcaacc 


caatcacaaa 


cacagtaagc 


actgcactcg 


33060 


tctccctcaa 


gtttgatgca 


agtggagttt 


tgctaagcag 


ctccacatta 


gacaaagaat 


33120 


attggaactt 


cagaaaggga 


gatgttacac 


ctgctgagcc 


ctatactaat 


gctataggtt 


33180 
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ttatgcctaa 


cataaaggcc 


tatcctaaaa 


acacatctgc 


agcttcaaaa 


agccatattg 


33240 


tcagtcaagt 


ttatctcaat 


gggqatgaqg 

z) ZJ Z/Zj 3 3 3 


ccaaaccact 


gatgctgatt 


attactttta 


33300 


atgaaactga 


ggatgcaact 


tgcacctaca 


gtatcacttt 


tcaatggaaa 


tgggatagta 


33360 


ctaagtacac 


aggtgaaaca 

Zf ZJ 3 


cttgctacca 


gctccttcac 


cttctcctac 


atcgcccaag 


33420 


aatgaacact 


gtatcccacc 


ctgcatgcca 


acccttccca 


ccccactctg 


tctatggaaa 


33480 


aaactctgaa 


gcacaaaata 


aaataaagtt 


caagtgtttt 


attgattcaa 


cagttttaca 


33540 


ggattcgagc 

33*** ** ^3 3 


agttattttt 


cctccaccct 


cccaggacat 


ggaatacacc 

3 3 


accctctccc 


33600 


cccgcacagc 


cttgaacatc 


tgaatgccat 


tggtgatgga 


catgcttttg 


gtctccacgt 


33660 


tccacacagt 


ttcagagcga 


gccagtctcg 


ggtcggtcag 

3 3 3 3 3 


ggagatgaaa 


ccctccgggc 


33720 


actcccgcat 


ctgcacctca 


cagctcaaca 


gctgaggatt 


gtcctcggtg 

3 33 3 


gtcgggatca 


33780 


cggttatctg 


gaagaagcag 


aagagcggcg 


gtgggaatca 


tagtccgcga 


acgggatcgg 

333 77 


33840 


ccgqtggtgt 

33 33 3 


cgcatcaggc 


cccgcagcag 


tcgctgccgc 


cgccgctccg 


tcaagctgct 


33900 


gctcaqqqqq 

-3 33 333 


tccgggtcca 


gggactccct 


cagcatgatg 


cccacggccc 


tcagcatcag 


33960 


tcgtctggtg 


cgqcgggcgc 


agcagcgcat 


gcggatctcg 

3 3 3 3 


ctcaggtcgc 


tgcagtacgt 


34020 


gcaacacagg 


accaccaggt 


tgttcaacag 


tccatagttc 


aacacgctcc 


agccgaaact 


34080 


catcgcggga 


aggatgctac 


ccacgtggcc 


gtcgtaccag 


atcctcaggt 


aaatcaagtg 


34140 


gcgctccctc 


cagaacacgc 

3 3 


tgcccacgta 


catgatctcc 


ttgggcatgt 


ggcggttcac 


34200 


cacctcccgg 


taccacatca 


ccctctggtt 


gaacatgcag 


ccccggatga 


tcctgcggaa 


34260 


ccacagggcc 


agcaccgccc 


cgcccgccat 


gcagcgaaga 


gaccccgggt 


cccggcaatg 


34320 


gcaatggagg 


acccaccgct 


cgtacccgtg 


gatcatctgg 


gagctgaaca 


agtctatgtt 


34380 


ggcacagcac 


aggcatatgc 


tcatgcatct 


cttcagcact 


ctcagctcct 


cgggggtcaa 

7 7 7 7 7 


34440 


aaccatatcc 


cagggcacgg 


ggaactcttg 


caggacagcg 


aaccccgcag 


aacagggcaa 


34500 


tcctcgcaca 


taacttacat 


tgtgcatgga 


cagggtatcg 


caatcaggca 


gcaccgggtg 

3 333 3 


34560 


atcctccacc 


agagaagcgc 


gggtctcggt 

333 w *^33 


ctcctcacag 


cgtggtaagg 

3 33 3 3 


gggccggccg 

777 33 3 


34620 


atacgggtga 


tggcgggacg 

33 7 7 7 7 


cggctgatcg 


tgttcgcgac 


cgtgtcatga 


tgcagttgct 


34680 


ttcggacatt 


ttcgtacttg 

3 3 


ctgtagcaga 


acctggtccg 


ggcgctgcac 


accgatcgcc 


34740 


ggcggcggtc 

3333 33 


ccggcgcttg 

ZJ ZJ ZJ ZJ 


gaacgctcgg 


tgttgaaatt 


gtaaaacagc 


cactctctca 


34800 


gaccgtgcag 


cagatctagg 


gcctcaggag 


tgatgaagat 


cccatcatgc 


ctgatagctc 


34860 


tgatcacatc 


gaccaccgtg 


gaatgggcca 


gacccagcca 


gatgatgcaa 


ttttgttggg 


34920 


tttcggtgac 


ggcqggggag 

33 33973 3 


ggaagaacag 


gaagaaccat 


gattaacttt 


taatccaaac 


34980 


ggtctcggag 


cacttcaaaa 


tgaaggtcgc 


ggagatggca 

33 3 3 3 


cctctcgccc 


ccgctgtgtt 


35040 


ggtggaaaat 

Mm* Mm* mj mm* 


aacagccagg 


tcaaaggtga 


tacggttctc 


gagatgttcc 


acggtqqctt 

3 3 *"3 3 


35100 


^r*3/*i^^ "3 

cudycaaagc 


c xccacgcgc 


acaxccagaa 


acaagacaar 


agcgaaagcg 


ggagggttct 




ctaattcctc 


aatcatcatg 


ttacactcct 


gcaccatccc 


cagataattt 


tcatttttcc 


35220 


agccttgaat 


gattcgaact 


agttcctgag 


gtaaatccaa 


gccagccatg 


ataaagagct 


35280 
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cgcgcagagc 


gccctccacc 


ggcattctta 


agcacaccct 


cataattcca 


agatattctg 


35340 


ctcctggttc 


acctgcagca 


gattgacaag 


cggaatatca 


aaatctctgc 


cgcgatccct 


35400 


aagctcctcc 


ctcagcaata 


actgtaagta 


ctctttcata 


tcctctccga 


aatttttagc 


35460 


cataggacca 


ccaggaataa 


gattagggca 


agccacagta 


cagataaacc 


gaagtcctcc 


35520 


ccagtgagca 


ttgccaaatg 


caagactgct 


ataagcatgc 


tggctagacc 


cggtgatatc 


35580 


ttccagataa 


ctggacagaa 


aatcacccag 


gcaattttta 


agaaaatcaa 


caaaagaaaa 


35640 


atcctccagg 


tgcacgttta 


gagcctcggg 


aacaacgatg 


aagtaaatgc 


aagcggtgcg 


35700 


ttccagcatg 


gttagttagc 


tgatctgtaa 


aaaacaaaaa 


ataaaacatt 


aaaccatgct 


35760 


agcctggcga 


acaggtgggt 


aaatcgttct 


ctccagcacc 


aggcaggcca 


cggggtctcc 


35820 


ggcgcgaccc 


tcgtaaaaat 


tgtcgctatg 


attgaaaacc 


atcacagaga 


gacgttcccg 


35880 


gtggccggcg 


tgaatgattc 


gacaagatga 


atacaccccc 


ggaacattgg 


cgtccgcgag 


35940 


tgaaaaaaag 


cgcccgagga 


agcaataagg 


cactacaatg 


ctcagtctca 


agtccagcaa 


36000 


agcgatgcca 


tgcggatgaa 


gcacaaaatc 


ctcaggtgcg 


tacaaaatgt 


aattactccc 


36060 


ctcctgcaca 


ggcagcgaag 


cccccgatcc 


ctccagatac 


acatacaaag 


cctcagcgtc 


36120 


catagcttac 


cgagcagcag 


cacacaacag 


gcgcaagagt 


cagagaaagg 


ctgagctcta 


36180 


acctgtccac 


ccgctctctg 


ctcaatatat 


agcccagatc 


tacactgacg 


taaaggccaa 


36240 


agtctaaaaa 


tacccgccaa 


ataatcacac 


acgcccagca 


cacgcccaga 


aaccggtgac 


36300 


acactcaaaa 


aaatacgcgc 


acttcctcaa 


acgcccaaac 


tgccgtcatt 


tccgggttcc 


36360 


cacgctacgt 


catcggaatt 


cgactttcaa 


attccgtcga 


ccgttaaaaa 


cgtcacccgc 


OCA 


cccgccccta 


acggtcgccc 


gtctctcggc 


caatcacctt 


cctccctccc 


caaattcaaa 


36480 


cagctcattt 


gcatattaac 


gcgcaccaaa 


agtttgaggt 


atattattga 


tgatg 


36535 


<210> 4 
<211> 34264 
<212> DNA 

<213> simian adenovirus SV-1 










<400> 4 

tccttattct ggaaacgtgc caatatgata 


atgagcgggg 


aggagcgagg 


cggggccggg 


60 


gtgacgtgcg gtgacgtggg 


gtgacgcggg 


gtggcgcgag 


ggcggggcgg 


gagtggggag 


120 


gcgcttagtt tttacgtatg 


cggaaggagg 


ttttataccg 


gaagttgggt 


aatttgggcg 


180 


tatacttgta agttttgtgt aatttggcgc 


gaaaaccggg 


taatgaggaa 


gttgaggtta 


240 


atatgtactt tttatgactg ggcggaattt 


ctgctgatca 


gcagtgaact 


ttgggcgctg 


300 


acggggaggt ttcgctacgt ggcagtacca 


cgagaaggct 


caaaggtccc 


atttattgta 


360 


ctcctcagcg ttttcgctgg gtatttaaac 


gctgtcagat 


catcaagagg 


ccactcttga 


420 


gtgccggcga gtagagtttt ctcctccgcg 


ctgccgcgat 


gaggctggtt 


cccgagatgt 


How 


acggtgtttt ctgcagcgag acggcccgga 


actcagatga 


gctgcttaat 


acagatctgc 


540 


tggatgttcc caactcgcct gtggcttcgc 


ctccgtcgct tcatgatctt 
Page 54 


ttcgatgtgg 
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aagtggatcc 


accgcaagat 


cccaacgagg 


acgcggtaaa 


cagtatgttc 


cctgaatgtc 


660 


tgtttgaggc 


ggctgaggag 


ggttctcaca 


gcagtgaaga 


gagcagacgg 


ggagaggaac 


720 


tggacttgaa 


atgctacgag 


gaatgtctgc 


cttctagcga 


ttctgaaacg 


gaacagacag 


780 


ggggagacgg 


ctgtgagtcg 


gcaatgaaaa 


atgaacttgt 


attagactgt 


ccagaacatc 


840 


ctggtcatgg 


ctgccgtgcc 


tgtgcttttc 


atagaaatgc 


cagcggaaat 


cctgagactc 


900 


tatgtgctct 


gtgttatctg 


cgccttacca 


gcgattttgt 


atacagtaag 


taaagtgttt 


960 


tcattggcgt 


acggtagggg 


attcgttgaa 


gtgctttgtg 


acttattatg 


tgtcattatt 


1020 


tctaggtgac 


gtgtccgacg 


tggaagggga 


aggagataga 


tcaggggctg 


ctaattctcc 


1080 


ttgcactttg 


ggggctgtgg 


ttccagttgg 


catttttaaa 


ccgagtggtg 


gaggagaacg 


1140 


agccggagga 


gaccgagaat 


ctgagagccg 


gcctggaccc 


tccagtggaa 


gactaggtgc 


1200 


tgaggatgat 


cctgaagagg 


ggactagtgg 


gggtgctagg 


aaaaagcaaa 


aaactgagcc 


1260 


tgaacctaga 


aactttttga 


atgagttgac 


tgtaagccta 


atgaatcggc 


agcgtcctga 


1320 


gacggtgttt 


tggactgagt 


tggaggatga 


gttcaagaag 


ggggaattaa 


acctcttgta 


1380 


caagtatggg 


tttgagcagt 


tgaaaactca 


ctggttggag 


ccgtgggagg 


atatggaaat 


1440 


ggctctagac 


acctttgcta 


aagtggctct 


gcggccggat 


aaagtttaca 


ctattcgccg 


1500 


cactgttaat 


ataaaaaaga 


gtgtttatgt 


tatcggccat 


ggagctctgg 


tgcaggtgca 


1560 


gaccccagac 


cgggtggctt 


tcaattgcgg 


catgcagagt 


ttgggccccg 


gggtgatagg 


1620 


tttgaatgga 


gttacatttc 


aaaatgtcag 


gtttactggt 


gatgatttta 


atggctctgt 


1680 


gtttgtgact 


agcacccagc 


taaccctcca 


cggtgtttac 


ttttttaact 


ttaacaatac 


1740 


atgtgtggag 


tcatggggta 


gggtgtctct 


gaggggctgc 


agttttcatg 


gttgctggaa 


1800 


ggcggtggtg 


ggaagaatta 


aaagtgtcat 


gtctgtgaag 


aaatgcatat 


ttgaacgctg 


1860 


tgtgatagct 


ctagcagtag 


aggggtacgg 


acggatcagg 


aataacgccg 


catctgagaa 


1920 


tggatgtttt 


cttttgctga 


aaggtacggc 


cagcgttaag 


cataatatga 


tttgcggcag 


1980 


cggcctgtgc 


ccctcgcagc 


tcttaacttg 


cgcagatgga 


aactgtcaca 


ccttgcgcac 


2040 


cgtgcacata 


gtgtcccact 


cgcgccgcac 


ctggccaaca 


tttgagcaca 


atatgctcat 


2100 


gcgttgcgcc 


gttcacctag 


gtgctagacg 


cggcgtgttt 


atgccttatc 


aatgtaactt 


2160 


tagtcatact 


aagattttgc 


tggaaactga 


ttccttccct 


cgagtatgtt 


tcaatggggt 


2220 


gtttgacatg 


tcaatggaac 


tttttaaagt 


gataagatat 


gatgaaacca 


agtctcgttg 


2280 


tcgctcatgt 


gaatgcggag 


ctaatcattt 


gaggttgtat 


cctgtaaccc 


tgaacgttac 


2340 


cgaggagctg 


aggacggacc 


accacatgct 


gtcttgcctg 


cgtaccgact 


atgaatccag 


2400 


cgatgaggag 


tgaggtgagg 


ggcggagcca 


caaagggtat 


aaaggggcat 


gaggggtggg 


2460 


cgcggtgttt 


caaaatgagc 


gggacgacgg 


acggcaatgc 


gtttgagggg 


ggagtgttca 


2520 


gcccatatct 


gacatctcgt 


cttccttcct 


gggcaggagt 


tcgtcagaat 


gtagtgggct 


2580 


ccaccgtgga 


cggacggccg 


gtcgcccctg 


caaattccgc 


caccctcacc 


tatgccaccg 


2640 
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taqqatcatc 


gttggacact 


qccqcqqcaq 


ctgccgcttc 


tgctgccgct 


tctactgctc 


2700 


acaacataac 


ggctgatttt 


ggactatata 


accaactggc 


cactgcagct 


gtggcgtctc 

3 33 3 


2760 


aatctctaat 


tcaagaagat 


qccctqaatg 


tgatcttgac 


tcgcctqqag 

3 33 3 


atcatgtcac 


2820 


atcacctqqa 

3 3 3 


cgaactggct 


gcgcagatat 


cccaagctaa 


ccccgatacc 


gcttcagaat 


2880 


cttaaaataa 


agacaaacaa 


atttgttgaa 


aagtaaaatg 


gctttatttg 


ttttttttgg 


2940 


ctcqqtaqqc 


tcgggtccac 


ctgtctcggt 


cgttaaggac 


tttgtgtatg 


ttttccaaaa 


3000 


cacggtacag 


atqqqcttqq 

333 w 3 3 


atgttcaagt 


acatgggcat 


gaggccatct 


ttggggtgga 

» ^3333 3 3 


3060 


gataggacca 


ctqaaqaqcq 

3 3 3 3 


tcatgttccg 


gggtggtatt 

333 33**** Wfc 


gtaaatcacc 


cagtcgtagc 


3120 


agggtttttg 


agcgtggaac 


tggaatatgt 


ccttcaggag 


caggctaatg 


gccaagggta 


3180 


gacccttagt 


gtaggtgttt 


acaaagcggt 


tgagctggga 


gggatgcatg 


cqqqqqqaqa 


3240 


tgatatgcat 


cttggcttgg 


attttgaggt 


tagctatgtt 


accacccagg 


tctctgcqgg 

3 ^*333 


3300 


qqttcatqtt 


atgaaggacc 


accagcacgg 


tatagccagt 


qcatttgggg 


aacttgtcat 


3360 


qcaqtttqqa 


qqqqaaqqcq 

3333 UM 33 v -3 


tggaagaatt 


tagatacccc 


cttgtgcccc 


cctaggtttt 


3420 


ccatgcactc 


atccataata 


atqqcaatqq 

w 23 23 ^ 23 23 


qacccctqqc 

23 27 23 


qqccqcttta 

23 23 23 


gcaaacacgt 


3480 


tttqqqqqtt 


ggaaacatca 


tagttttgct 


ctagagtgag 


ctcatcatag 


gccatcttta 


3540 


caaaqcqqqq 


taqqaqqgtg 

»-«*33«*333 *-3 


cccqactggq 

3** w 333 


qgatgatagt 

23 23 23 ^ 


tccatctggg 


cctggagcgt 


3600 


agttgccctc 


acagatctgc 


atctcccagg 


ccttaatttc 


cgaggggggg 

*3 3333333 


atcatgtcca 


3660 


cctqqqqqqc 

v *333333* 


gataaaaaac 


acqgtttctg 
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aaagagccgc tgggtgcgcg cgtgggagcc gatcgggaag aagctgatct cctgccacca 6840 

gttggaggaa tgggtgttga tgtggtgaaa gtagaagtcc cgccggcgca cagagcattc 6900 

gtgctgatgt ttgtaaaagc gaccgcagta gtcgcagcgc tgcacgctct gtatctcctg 6960 

aatgagatgc gcttttcgcc cgcgcaccag aaaccggagg gggaagttga gacgggggct 7020 

tggtggggcg gcatcccctt cgccttggcg gtgggagtct gcgtctgcgc cctccttctc 7080 

tgggtggacg acggtgggga cgacgacgcc ccgggtgccg caagtccaga tctccgccac 7140 

ggaggggcgc aggcgttgca ggaggggacg cagctgcccg ctgtccaggg agtcgagggc 7200 

ggccgcgctg aggtcggcgg gaagcgtttg caagttcact ttcagaagac cggtaagagc 7260 

gtgagccagg tgcacatggt acttgatttc caggggggtg ttggaagagg cgtccacggc 7320 

gtagaggagg ccgtgtccgc gcggggccac caccgtgccc cgaggaggtt ttatctcact 7380 

cgtcgagggc gagcgccggg gggtagaggc ggctctgcgc cggggggcag cggaggcagt 7440 

ggcacgtttt cgtgaggatt cggcagcggt tgatgacgag cccggagact gctggcgtgg 7500 

gcgacgacgc ggcggttgag gtcctggatg tgccgtctct gcgtgaagac caccggcccc 7560 

cgggtcctga acctgaaaga gagttccaca gaatcaatgt ctgcatcgtt aacggcggcc 7620 

tgcctgagga tctcctgtac gtcgcccgag ttgtcttgat aggcgatctc ggccatgaac 7680 

tgctccactt cttcctcgcg gaggtcgccg tggcccgctc gctccacggt ggcggccagg 7740 

tcgttggaga tgcgacgcat gagttgagag aaggcgttga ggccgttctc gttccacacg 7800 

cggctgtaca ccacgtttcc gaaggagtcg cgcgctcgca tgaccacctg ggccacgttg 7860 

agttccacgt ggcgggcgaa gacggcgtag tttctgaggc gctggaagag gtagttgagc 7920 

gtggtggcga tgtgctcgca gacgaagaag tacatgatcc agcgccgcag ggtcatctcg 7980 

ttgatgtctc cgatggcttc gagacgctcc atggcctcgt agaagtcgac ggcgaagttg 8040 

aaaaattggg agttgcgggc ggccaccgtg agttcttctt gcaggaggcg gatgagatcg 8100 

gcgaccgtgt cgcgcacctc ctgctcgaaa gcgccccgag gcgcctctgc ttcttcctcc 8160 

ggctcctcct cttccagggg cacgggttcc tccggcagct ctgcgacggg gacggggcgg 8220 

cgacgtcgtc gtctgaccgg caggcggtcc acgaagcgct cgatcatttc gccgcgccgg 8280 

cgacgcatgg tctcggtgac ggcgcgtccg ttttcgcgag gtcgcagttc gaagacgccg 8340 

ccgcgcagag cgcccccgtg cagggagggt aagtggttag ggccgtcggg cagggacacg 8400 

gcgctgacga tgcattttat caattgctgc gtaggcactc cgtgcaggga tctgagaacg 8460 

tcgaggtcga cgggatccga gaacttctct aggaaagcgt ctatccaatc gcagtcgcaa 8520 

ggtaagctga ggacggtggg ccgctggggg gcgtccgcgg gcagttggga ggtgatgctg 8580 

ctgatgatgt aattaaagta ggcggtcttc aggcggcgga tggtggcgag gaggaccacg 8640 

tctttgggcc cggcctgttg aatgcgcagg cgctcggcca tgccccaggc ctcgctctga 8700 

cagcgacgca ggtctttgta gtagtcttgc atcagtctct ccaccggaac ctctgcttct 8760 

cccctgtctg ccatgcgagt cgagccgaac ccccgcaggg gctgcagcaa cgctaggtcg 8820 

gccacgaccc tctcggccag cacggcctgt tggatctgcg tgagggtggt ctggaagtcg 8880 
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wuy wll Ld\»y 


cccaaaaaca 

vLLnyyay v, w 


y Lyyyay Lyy 


aaaaaraatc 


wy w Ly Ly w ll 


y l Lyy l l l Ly 


12240 


ccacactaac 


atcatcaaaa 

y v_ v_ y L- v_ u. y w y 


aaaaaaccca 

nynnvjuvLLVj 


yuwyww Lwwy 


caaacaataa 

Luyuwyu Luy 


"taacatatta 

l wy Ly Ly l Ly 


12300 


aacctaaaaa 

y w w w *-y y y wy 


aaaacaaaac 
yy u y yy w y *— 


caaccccttt 

L. \A, \*K L» L. L V. L L L 


actcacttac 

y w l wu w l Ly w 


aacccaaaaa 

ww w w w wwy y y 


aca*t"tccaat: 

y wy l l w w w y l 


12360 


cacctctact 

w y ww l wu\> l 


aataaaaaaa 


acacaaaaac 

w l y v. y y uuu\> 


ttaccaaaac 

l lww WLAy wy w 


cataaccaca 

ww Lyy w w w wu 


y l y Ly Ly l ll 


12420 


tttcttcctc 

W W W V» W V %• V» W W> 


tctttcttcc 

w www v» w w v» w» 


tcaacacaac 

*-y y v^y v^y y v. 


agaatgagaa 


aaacaataaa 

y«*yLyy vy wy 


aatcacacca 

VA WJ W Ww WV Wj VJ Vjj Wj WJ 


12480 


QCQQcatata 


agggtccgcc 


cccttcttac 


aaaaacataa 

ywwwy wy Ly w 


taaaatcaac 
yyy Lwuyw 


Q3.t±CQt.QCCQ 

Vj W% W\ Wji WJ W WJ Wj Wj WJ 


12540 


gccacgctgg 


aggcgcctta 


cgttcctccc 


aaatacctaa 


aacctacaaa 

yj I4W WWVW^WJVf WL 


aaacaaaaac 

WJ WJ WJ WUVIUUU\^ 


12600 


aacatccatt 

w»j wu w w wy w l 


actccaaact 


yy vuvv.v*v uy 


"tacaatacca 

Lwwy W LUWWU 


ccaaaatiata 

l Lwwy y Ly lu 


cctaat'aaac 

ll vyy lu y w l 


12660 


aacaaatcaa 

w**wv*uy y y 


caaacatcac 

wy y uwu l *j wj 


ctccctaaat 

w v> \_ l l yuu L 


tatcaaaaca 

lu l www.ww wy 


atcacaacaa 

U LLULuy LUU 


ttttctaact 

L L L LL Ly UL L 


12720 


accataatac 


aaaacaataa 


cttcacccca 


acaaaaacaa 

awyyayy wyy 


acacacaaac 

yLULyLHVjUL 


ra1"taari-t* t* 

Lw L LwwL L L L 


12780 

-1. C t VJ L* 


aacaaacatt 


cccactaaaa 
LV,v *y '•yyyy 


caatcaacta 

««yy *»vuy v Ly 


aaaaccatcc 

%4UMM WIA WW^Wj 


tacacaccaa 

LyLWLwLLUU 


catacccaac 

LU Vy LLLWWL 


12840 


atcaacgagt 


tcatgtccac 


caacaagttc 


agggccaggc 


tgatggttaa 


aaaggctgaa 


12900 


aaccagcctc 


ccgagtacga 


atggtttgag 


ttcaccattc 


ccgagggcaa 


ctattccgag 


12960 


accatgacta 


tcgatctgat 


gaacaatgcg 


atcgtggaca 


attacctgca 


agtggggagg 


13020 
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cagaacgggg 


tattggaaag 


cgatatcggc 


gtaaaatttg 


ataccagaaa 


cttccgactg 


13080 


gggtgggatc 


ccgtgaccaa 


gctggtgatg 


ccaggcgtgt 


acaccaacga 


ggcttttcac 


13140 


cccgacatcg 


tgctgctgcc 


ggggtgcggt 


gtggacttca 


ctcagagccg 


tttgagtaac 


13200 


ctgttaggga 


tcagaaagcg 


ccgccccttc 


caagagggct 


ttcagatcat 


gtatgaggac 


13260 


ctggaaggag 


gtaacattcc 


aggtttgcta 


gacgtgccgg 


cgtatgaaga 


gagtgttaaa 


13320 


caggcggagg 


cgcagggacg 


agagattcga 


ggcgacacct 


ttgccacgga 


acctcacgaa 


13380 


ctggtaataa 


aacctctgga 


acaagacagt 


aaaaaacgga 


gttacaacat 


tatatccggc 


13440 


actatgaata 


ccttgtaccg 


gagctggttt 


ctggcttaca 


actacgggga 


tcccgaaaag 


13500 


ggagtgagat 


catggaccat 


actcaccacc 


acggacgtga 


cctgcggctc 


gcagcaagtg 


13560 


tactggtccc 


tgccggatat 


gatgcaagac 


ccggtcacct 


tccgcccctc 


cacccaagtc 


13620 


agcaacttcc 


cggtggtggg 


caccgagctg 


ctgcccgtcc 


atgccaagag 


cttctacaac 


13680 


gaacaggccg 


tctactcgca 


actcattcgc 


cagtccaccg 


cgcttaccca 


cgtgttcaat 


13740 


cgctttcccg 


agaaccagat 


tctggtgcgc 


cctcccgctc 


ctaccattac 


caccgtcagt 


13800 


gaaaacgttc 


ccgccctcac 


agatcacgga 


accctgccgc 


tgcgcagcag 


tatcagtgga 


13860 


gttcagcgcg 


tgaccatcac 


cgacgccaga 


cgtcgaacct 


gtccctacgt 


ttacaaagct 


13920 


cttggcgtag 


tggctcctaa 


agtgctctct 


agtcgcacct 


tctaaacatg 


tccatcctca 


13980 


tctctcccga 


taacaacacc 


ggctggggac 


tgggctccgg 


caagatgtac 


ggcggagcca 


14040 


aaaggcgctc 


cagtcagcac 


ccagttcgag 


ttcggggcca 


cttccgtgct 


ccctggggag 


14100 


cttacaagcg 


aggactctcg 


ggccgaacgg 


cggtagacga 


taccatagat 


gccgtgattg 


14160 


ccgacgcccg 


ccggtacaac 


cccggaccgg 


tcgctagcgc 


cgcctccacc 


gtggattccg 


14220 


tgatcgacag 


cgtggtagct 


ggcgctcggg 


cctatgctcg 


ccgcaagagg 


cggctgcatc 


14280 


ggagacgtcg 


ccccaccgcc 


gccatgctgg 


cagccagggc 


cgtgctgagg 


cgggcccgga 


14340 


gggtaggcag 


aagggctatg 


cgccgcgctg 


ccgccaacgc 


cgccgccggg 


agggcccgcc 


14400 


gacaggctgc 


ccgccaggct 


gctgccgcca 


tcgctagcat 


ggccagaccc 


aggagaggga 


14460 


acgtgtactg 


ggtgcgcgat 


tctgtgacgg 


gagtccgagt 


gccggtgcgc 


agccgacctc 


14520 


cccgaagtta 


gaagatccaa 


gctgcgaaga 


cggcggtact 


gagtctccct 


gttgttatca 


14580 


gcccaacatg 


agcaagcgca 


agtttaaaga 


agaactgctg 


cagacgctgg 


tgcctgagat 


14640 


ctatggccct 


ccggacgtga 


agcctgacat 


taagccccgc 


gatatcaagc 


gtgttaaaaa 


14700 


gcgggaaaag 


aaagaggaac 


tcgcggtggt 


agacgatggc 


ggagtggaat 


ttattaggag 


14760 


tttcgccccg 


cgacgcaggg 


ttcaatggaa 


agggcggcgg 


gtacaacgcg 


ttttgaggcc 


14820 


gggcaccgcg 


gtagttttta 


ccccgggaga 


gcggtcggcc 


gttaggggtt 


tcaaaaggca 


14880 


gtacgacgag 


gtgtacggcg 


acgaggacat 


attggaacag 


gcggctcaac 


agatcggaga 


14940 


atttgcctac 


ggaaagcgtt 


cgcgtcgcga 


agacctggcc 


atcgctttag 


acagcggcaa 


15000 


ccccacgccc 


agcctcaaac 


ctgtgacgct 


gcagcaggtg 


ctccccgtga 


gcgccagcac 


15060 
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qqacaqcaaq 

3 3 ** ^**3 **** 3 


aggggaataa 


aaagagaaat 


ggaagatctg 


cagcccacca 


tccagctcat 


15120 


aatccctaaa 


cqqcaqaqqc 

*-3 3 ^***3 **3 3 N - 


tqqaaqaqqt 


cctqqaqaaa 


atqaaaqtqq 


acccaagcat 


15180 


agagccggac 


gtcaaagtca 


ggccgatcaa 


agaagtggcc 


cctggtctcg 


qqgtqcagac 


15240 


ggtggatatc 

33 33 


cagatccccg 


tcacgtcagc 


ttcgaccgcc 


qtggaagcca 


tggaaacgca 


15300 


aacggaaacc 


cctgccgcga 


tcggtaccag 


ggaagtggcg 


ttgcaaaccg 


acccctggta 


15360 


cgaatacgcc 


gcccctcggc 


gtcagaggcg 


acccgctcgt 


tacggccccg 


ccaacgccat 


15420 


catgccagaa 


tatqcqctqc 


atccgtctat 


cctgcccacc 


cccggctacc 


qqqqaqtqac 


15480 


qtatcqcccq 


tcaqqaaccc 


gccgccgaac 


ccgtcgccgc 


cgccgctccc 


gtcgtgctct 


15540 


qqcccccqtq 


tcqqtqcqcc 


qcqtaacacq 

3 3 3 


ccqqqqaaaq 

vv, 3333 uuu 3 


acagttacca 


ttcccaaccc 


15600 


gcgctaccac 


cctagcatcc 


tttaatgact 


ctgccgtttt 


qcaqatqqct 


ctgacttgcc 


15660 


qcqtqcqcct 


tcccgttccg 


cactatcgag 


gaagatctcg 


tcqtaqqaqa 

w *"*3 33 3 


qqcatqqcaq 

33^" ^33^33 


15720 


qtagtggtcq 


ccggcgggct 


ttgcqcaqqc 

u 3'-3 vvl 33 


gcatgaaagg 


cggaatttta 


cccgctctga 


15780 


tacccataat 


cgccgccgcc 


atcggtgcca 


tacccggcgt 


cqcttcaqtq 


gccttgcaag 


15840 


cagctcgtaa 


taaataaacg 


aaggcttttg 


cacttatgtc 


ctqatcctqa 
3 3 ^ w 3^^ 


ctattttatg 


15900 


cagaaagagc 


atggaagaca 


tcaattttac 


atcactaact 

3 tv -y L 33 L *- 


ccacaacaca 

3 33 3 


Qctcacaacc 

j3 **■ *■ **• V* w %^ v_ 


15960 


qctcatqqqc 


acctqqaacq 


acatcggcac 


cagtcagctc 


aacqqqqqcq 

uu%, 33333^3 


ctttcaattg 


16020 


ggggagcctt 


tqqaqcqqca 


ttaaaaactt 


tggctccacg 


attaaatcct 


acggcagcaa 


16080 


agcctggaac 


agtagtgctg 

3 3 ^ 3 3 


gtcagatgct 


ccgagataaa 


ctgaaggaca 


ccaacttcca 


16140 


agaaaaagtg 

3 3 


gtcaatgggg 

3 3 3 3 3 


tggtgaccgg 

— 33 ^3 33 


catccacggc 


gcgqtaqatc 

3 33 w 3 «» ^ 


tcgccaacca 


16200 


agcggtgcag 

ZJ ^ "~ 3 3 


aaagagattg 


acaggcgttt 


ggaaagctcg 


cgggtgccgc 

333 w 3 ^" w 3 w 


cqcaqaqaqq 

3^" w *3 w *'3 Wfc 33 


16260 


ggatqaqqtq 


gaqqtcqaqq 

3 33 3** 3 3 


aagtagaagt 


agaggaaaag 


ctqcccccac 
™ *■ 3 3 


tggagaaagt 


16320 


tcccggtgcg 

33 3 3 


cctccgagac 


cqcaqaaqcq 

3 3 **** 3 *»3 


acccaggcca 


gaactagaag 


aaactctggt 


16380 


qacqqaqaqc 

3*^^*3 3 3 Z5 


aaggagcctc 


cctcgtacga 


qcaaaccttq 


aaaqaqqqcq 


cctctccacc 


16440 


ctacccaatg 


acaaaaccga 


tcgcgcctat 


qqctcqqccq 

33 v w> "3 3 3 


qtqtacqqqa 


aggactacaa 


16500 


gcctgtcacg 


ctagagctcc 


ccccgccgcc 


accgccgccc 


cccacgcgcc 


cgaccgttcc 


16560 


cccccccctg 


ccggctccgt 


cggcgggacc 


cgtgtccgca 


cccqtcqccq 


tgcctctgcc 


16620 


agccgcccgc 


ccaqtqqccq 

3 33 3 


tggccactqc 


cagaaacccc 


aqaqqccaqa 


qaqqaqccaa 

3 u g3 3 


16680 


ctggcaaagc 


acgctgaaca 


qcatcqtqqq 

3 w "-3 w 333 


cctqqqaqtq 

*"33 3 3 3 


aaaagcctga 


aacgccgccg 


16740 


ttgctattat 


taaaagtgta 


gctaaaaaat 


ttcccgttgt 


atacqcctcc 


tatattacca 


16800 


ccagagacgc 


gtgactgtcg 

3 3 3 *• ^-3 


ccgcgagcgc 


cgctttcaag 


atqqccaccc 


catcqataat 


16860 


gccgcagtgg 

3 3^~~3 33 


tcttacatgc 


acatcgccgg 


gcaqqacqcc 


tcqqaqtacc 


tqaaccccqq 


16920 


tctcgtgcag 


ttcgcccgcg 


ccaccgacac 


ctacttcagc 


ttqqqaaaca 


aatttaaaaa 


16980 


ccccaccgtg 


gcccccaccc 


acgatgtaac 


cacggaccgc 


tcgcaaaggc 


tgaccctgcg 


17040 


ttttgtgccc 


gtagaccggg 


aggacaccgc 


gtactcttac 


aaagtgcgct 


acacgctggc 


17100 


cgtaggggac 


aaccgagtgc 


tggacatggc 


cagcacctac tttgacatcc 
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ggatcgcggt 


cccagtttta 


agccctactc 


gggtaccgcg 


tacaattccc 


tggctcccaa 


17220 


gggcgctccc 


aaccctgcag 


aatggacgaa 


ttcagacagc 


aaagttaaag 


tgagggcaca 


17280 


ggcgcctttt 


gttagctcgt 


atggtgctac 


agcgattaca 


aaagagggta 


ttcaggtggg 


17340 


agtaacctta 


acagactccg 


gatcaacacc 


acagtatgca 


gataaaacgt 


atcagcctga 


17400 


gccgcaaatt 


ggagaactac 


agtggaacag 


cgatgttgga 


accgatgaca 


aaatagcagg 


17460 


aagagtgcta 


aagaaaacaa 


cgcccatgtt 


cccttgttac 


ggctcatatg 


ccaggcccac 


17520 


taatgaaaaa 


ggaggacagg 


caacaccgtc 


cgctagtcaa 


gacgtgcaaa 


atcccgaatt 


17580 


acaatttttt 


gcctctacta 


atgtcgccaa 


tacaccaaaa 


gcagttctat 


atgcggagga 


17640 


cgtgtcaatt 


gaagcgccag 


acactcactt 


ggtgttcaaa 


ccaacagtca 


ctgaaggcat 


17700 


tacaagttca 


gaggctctac 


tgacccaaca 


agctgctccc 


aaccgtccaa 


actacatagc 


17760 


ctttagagat 


aattttattg 


gtctcatgta 


ctacaatagc 


acaggtaaca 


tgggagtact 


17820 


ggcaggccag 


gcttctcagc 


taaatgcagt 


tgttgacctg 


caagacagaa 


atactgagct 


17880 


gtcctaccaa 


ctcatgttgg 


acgccctcgg 


agaccgcagt 


cggtactttt 


ctatgtggaa 


17940 


ccaagctgtg 


gatagttacg 


atcctgatgt 


aagaatcata 


gaaaaccatg 


gcgtagaaga 


18000 


tgaattgcct 


aattattgct 


ttcctttggg 


aggcatggca 


gtaaccgaca 


cctactcgcc 


18060 


tataaaggtt 


aatggaggag 


gcaatggatg 


ggaagccaat 


aacggcgttt 


tcaccgaaag 


18120 


aggagtggaa 


ataggttcag 


ggaacatgtt 


tgccatggag 


attaacctgc 


aagccaacct 


18180 


atggcgtagc 


tttctgtact 


ccaatattgg 


gctgtacctg 


ccagactctc 


tcaaaatcac 


18240 


tcctgacaac 


atcacactcc 


cagagaacaa 


aaacacctat 


cagtatatga 


acggtcgcgt 


18300 


gacgccaccc 


gggctggttg 


acacctacgt 


taacgtgggc 


gcgcgctggt 


cccccgatgt 


18360 


catggacagt 


attaaccctt 


ttaatcacca 


ccgcaacgcc 


ggactccgct 


accgttccat 


18420 


gctcctggga 


aacggacgct 


acgtgccctt 


ccacatccag 


gtgccccaga 


aattctttgc 


18480 


aattaaaaac 


ctgctgctgc 


tccccggttc 


ctacacctac 


gagtggaact 


tccgcaagga 


18540 


cgtgaacatg 


atcttgcaga 


gctcgctggg 


caatgacctg 


cgagtggacg 


gggccagcat 


18600 


ccgcttcgac 


agcatcaacc 


tgtacgccaa 


ctttttcccc 


atggcccaca 


acacggcctc 


18660 


caccctggaa 


gccatgctgc 


gcaacgacac 


caacgaccaa 


tctttcaacg 


actacctgtg 


18720 


cgcggccaac 


atgctgtacc 


ccatccccgc 


caacgccacc 


agcgtgccca 


tctccattcc 


18780 


ctctcgcaac 


tgggcagcct 


tcaggggctg 


gagtttcacc 


cgcctcaaaa 


ccaaggagac 


18840 


cccctcgctg 


ggctccgggt 


tcgaccccta 


cttcgtctac 


tccggctcca 


tcccctacct 


18900 


ggacggcacc 


ttctacctca 


accatacttt 


caaaaaggtg 


tcaatcatgt 


tcgactcctc 


18960 


cgtcagctgg 


cccggcaacg 


accgtctgct 


gacgcccaac 


gagttcgaaa 


tcaagcgttc 


19020 


ggtggacggt 


gaagggtaca 


acgtggctca 


gagcaacatg 


accaaggact 


ggttcctgat 


19080 


tcagatgctc 


agccactaca 


acatcggcta 


ccagggcttc 


tacgtgcccg 


aaaattacaa 


19140 


ggaccgcatg 


tactctttct 


tcagaaactt 


ccaacccatg 


agccgccaaa 


ttgtagattc 


19200 
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aacaacttac 


actaattatc 


aggatgtgaa 


actgccatac 


cagcataaca 


actcagggtt 


19260 


cataaactac 


atqaqaccca 


ccatgcgaga 


gqqqcaggcc 

ZJZ3z3zj^* iJZJ^~^' 


tacccggcca 


actatcccta 


19320 


tcccctgatt 


qqqqccaccq 

23 23 «3 T3 23 


ccgtgcccag 


cctcacgcag 


aaaaagttcc 


tctgcgaccg 


19380 


qqtqatqtqq 


aggatcccct 


tctctagcaa 


cttcatgtct 


atgggctccc 


tcaccgacct 


19440 


qqqqcaqaac 


atgctgtacg 


ccaactccgc 


tcacgccttg 


gatatgacct 


ttgaggtgga 

mj mj mj 


19500 


tcccatggat 


gagcccacgc 


ttctctatgt 


tctgtttgaa 


gtcttcgacg 


tggtgcgcat 


19560 


ccaccagccg 


caccqcqqcq 


tcatcgaggc 


cgtctacctg 


cgcacacctt 


tctctgccgg 


19620 


taacgccacc 


acctaaagaa 


qccgatgggc 

3 3 3 33 


tccagcgaac 


aggagctgca 


ggccattgtt 


19680 


cqcgacctgg 


gctgcgqgcc 


ctactttttg 


ggcaccttcg 


acaagcgttt 


tcccggcttc 


19740 


atgtcccccc 


acaagccggc 


ctgtgccatc 


gttaacacgg 


ccggacggga 

ZJ ZJ ZJ ZJ Zf 


gaccgggggg 

9 ^ ~J ZJ mJ ZJ 


19800 


qtccactqqc 

23 *■ -w^^ w 23 23 


tcgccttcgc 


ctggaacccg 


cgtaaccgca 


cctgctacct 


gttcgaccct 


19860 


tttggtttct 


ccgacgaaag 


gctgaagcag 


atctaccagt 


tcgagtacga 


ggggctcctc 

ZJ ZJ ZJ ZJ 


19920 


aaqcqcaqcq 


ctctggcctc 


cacgcccgac 


cactgcgtca 


ccctggaaaa 


gtccacccaa 


19980 


acqqtccaqq 


ggcccctctc 


qqccgcctqc 

33 3 ^ ZJ 


gggctcttct 


gttgcatgtt 


tttgcacgcc 


20040 


ttcatqcact 


qacctcacac 


ccccatggat 


cacaacccca 


ccatggatct 


gctcaccgga 


20100 


qtqcccaaca 


gcatgcttca 


cagcccccag 


gtcgccccca 


ccctgcgccg 


taaccaggaa 


20160 


cacctgtatc 


gctttctggg 


gaaacactct 


gcctattttc 


gccgccaccg 


gcagcgcatc 


20220 


gaacgggcca 


cggccttcga 


aagcatgagc 


caaagagtgt 


aatcaataaa 


aaacattttt 


20280 


atttgacatg 


atacgcgctt 


ctggcgtttt 


attaaaaatc 


gaagggttcg 

3 3 3 3 3 


agggaggggt 


20340 


cctcgtgccc 


gctggggagg 


gacacgttgc 


gatactggaa 


acgggcgctc 

ZJ ZJ ZJ ZJ 


caacgaaact 


20400 


cgqggatcac 


cagccgcggc 


agqqgcacqt 


cttctaggtt 


ctgcttccaa 


aactgccgca 


20460 


ccagctgcag 


ggctcccatg 


acgtcgggcg 

3 w, "333*"3 


ccgatatctt 


gaagtcgcag 

3 3 -—~ZJ^~ —ZJ 


ttagggccgg 


20520 


agctcccgcg 


gctgttgcgg 


aacacggggt 


tggcacactg 


gaacaccagc 


acgccggggt 

ZJ ZJ ZJ ZJ ZJ 


20580 


tgtggatact 


ggccagggcc 


gtcgggtcgg 

3 333 33 


tcacctccga 


cgcatccaga 


tcctcggcgt 


20640 


tgctcagggc 

~3 333 


aaacggggtc 


agcttgcaca 


tctgccgccc 


aatctggggt 


actaggtcgc 


20700 


gcttqttqaq 

~7 I? 3 3 


gcagtcgcag 


cgcagaggga 


tcaggatgcg 

ZJ ZJ 3 3 


tcgctgcccg 


cgttgcatga 


20760 


tagggtaact 


cgccgccagg 


aactcctcca 


tttgacggaa 


ggccatctgg 


gctttgccgc 


20820 


cctcqqtqta 


gaatagcccg 


caggacttgc 


tagagaatac 


gttatgaccg 


cagttgacgt 


20880 


cctccgcgca 


qcaqcqqqcq 


tcttcgttct 


tcagctgaac 


cacgttgcgg 


ccccaacggt 


20940 


tctggaccac 


cttggctcta 


gtggggtgct 

3 ZJ ZJ ZJ ZJ ~ 


ccttcagcgc 


ccgctgtccg 


ttctcgctgg 


21000 


ttacatccat 


ttccaacacg 


tgctccttgc 


agaccatctc 


cactccgtgg 


aagcaaaaca 


21060 


ggacgccctc 


ctgctgggta 

^*3 3 3 3 


ctgcgatgct 


cccatacggc 


gcatccggtg 

3 3 3 3 


ggctcccagc 


21120 


tct tgtgt tt 


~> r~ t~ f /*i r~ r% 

tacccccgcg 


xaggcLtcca 


tgtaagccax 


aaggaatctg 


cccatcagct 




cggtgaaggt 


cttctggttg 


gtgaaggtta 


gcggcaggcc 


gcggtgctcc 


tcgttcaacc 


21240 


aagtttgaca 


gatcttgcgg 


tacaccgctc 


cctggtcggg 


cagaaactta 


aaagccgctc 


21300 
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tgctgtcgtt 


gtctacgtgg 


aacttctcca 


ttaacatcat 


catggtttcc 


atacccttct 


21360 


cccacgctgt 


caccagtggt 


ttgctgtcgg 


ggttcttcac 


caacacggcg 


gtagaggggc 


21420 


cctcgccggc 


cccgacgtcc 


ttcatggtca 


ttctttgaaa 


ctccacggag 


ccgtccgcgc 


21480 


gacgtactct 


gcgcaccgga 


gggtagctga 


agcccacctc 


caccacggtg 


ccttcgccct 


21540 


cgctgtcgga 


gacaatctcc 


ggggatggcg 


gcggcgcggg 


tgtcgccttg 


cgagccttct 


21600 


tcttgggagg 


gagctgaggc 


gcctcctgct 


cgcgctcggg 


gctcatctcc 


cgcaagtagg 


21660 


gggtaatgga 


gctgcctgct 


tggttctgac 


ggttggccat 


tgtatcctag 


gcagaaagac 


21720 


atggagctta 


tgcgcgagga 


aactttaacc 


gccccgtccc 


ccgtcagcga 


cgaagatgtc 


21780 


atcgtcgaac 


aggacccggg 


ctacgttacg 


ccgcccgagg 


atctggaggg 


gcctgaccgg 


21840 


cgcgacgcta 


gtgagcggca 


ggaaaatgag 


aaagaggagg 


cctgctacct 


cctggaaggc 


21900 


gacgttttgc 


taaagcattt 


cgccaggcag 


agcaccatag 


ttaaggaggc 


cttgcaagac 


21960 


cgctccgagg 


tgcccttgga 


cgtcgccgcg 


ctctcccagg 


cctacgaggc 


gaaccttttc 


22020 


tcgcctcgag 


tgcctccgaa 


gagacagccc 


aacggcacct 


gcgagcccaa 


cccgcgactc 


22080 


aacttctacc 


ccgtgttcgc 


cgtaccagag 


gcgctggcca 


cctatcacat 


ttttttcaaa 


22140 


aaccaacgca 


tccccctatc 


gtgccgggcc 


aaccgcaccg 


cggccgatag 


gaatctcagg 


22200 


cttaaaaacg 


gagccaacat 


acctgatatc 


acgtcgctgg 


aggaagtgcc 


caagattttc 


22260 


gagggtctgg 


gtcgagatga 


gaagcgggcg 


gcgaacgctc 


tgcagaaaga 


acagaaagag 


22320 


agtcagaacg 


tgctggtgga 


gctggagggg 


gacaacgcgc 


gtctggccgt 


cctcaaacgc 


22380 


tgcatagaag 


tctcccactt 


cgcctacccc 


gccctcaact 


tgccacccaa 


agttatgaaa 


22440 


tcggtcatgg 


atcagctgct 


catcaagaga 


gctgagcccc 


tggatcccga 


ccaccccgag 


22500 


gcggaaaact 


cagaggacgg 


aaagcccgtc 


gtcagcgacg 


aggagctcga 


gcggtggctg 


22560 


gaaaccaggg 


acccccaaca 


gttgcaagag 


aggcgcaaga 


tgatgatggc 


ggccgtgctg 


22620 


gtcaccgtgg 


agctggaatg 


cctgcaacgg 


tttttcagcg 


acgtggagac 


gctacgcaaa 


22680 


atcggggaat 


ccctgcacta 


caccttccgc 


cagggctacg 


tccgccaggc 


ctgcaagatc 


22740 


tccaacgtgg 


agctcagcaa 


cctggtctcc 


tacatgggca 


tcctccacga 


gaaccggctg 


22800 


gggcagagcg 


tgctgcactg 


caccttgcaa 


ggcgaggcgc 


ggcgggacta 


cgtgcgagac 


22860 


tgcatctacc 


tcttcctcac 


cctcacctgg 


cagaccgcca 


tgggcgtctg 


gcagcagtgc 


22920 


ttggaagaga 


gaaacctcaa 


agagctagac 


aaactcctct 


gccgccagcg 


gcgcgccctg 


22980 


tggtccggtt 


tcagcgagcg 


cacggtcgcc 


agcgctctgg 


cggacatcat 


cttcccggag 


23040 


cgcctgatga 


aaaccttgca 


aaacggcctg 


ccggatttca 


tcagtcaaag 


cattttgcaa 


23100 


aacttccgct 


cttttgtcct 


ggaacgctcc 


gggatcttgc 


ccgccatgag 


ctgcgcgcta 


23160 


ccttctgact 


ttgtccccct 


ctcctaccgc 


gagtgccctc 


ccccactgtg 


gagccactgc 


23220 


tacctcttcc 


aactggccaa 


ctttctggcc 


taccactccg 


acctcatgga 


agacgtaagc 


23280 


ggagagggtt 


tactggagtg 


ccactgccgc 


tgcaacctgt 


gcacccccca 


cagatcgctg 


23340 
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orri~ocaara 

y v_ v_ u y v,uuL.a 




caacaaaacc 


caaatcataa 


ataccttcaa 


aatccaaaaa 

y a uu.v.ayyyy 


23400 


u_\_^u»ciyu— ciyv» 


aaaaaoaTac 

CLa.ya.yyy ty v- 


ttccaactta 


aaactcactc 

uwy v» u.u>m%» v x» 


caacactata 

yy^y y y 


gacctcggct 


23460 




aatttataac 

uau*. u. v. v. y v. uy Vp 


caaaaactac 


cacacccaca 


aaattcagtt 


ttacgaagac 


23520 


caatctcaac 

\»vlu vu- v v>y uv. 


caccaaaaac 

^— *-A ^— W ^3 ^A*A\A 23 Vp 


ccccctcaca 


acctacatca 


tcacccagag 


caagatcctg 


23580 


acccaattac 


aatccatcaa 


ccaaacacac 


cgcgatttcc 


ttttgaaaaa 


qqqtcqqqqq 

yyy *- w yyyyy 


23640 


y vy uav.v tyy 


acccccaaac 

W% W* ^# Wj VUMVL V* 


caacaaaaaa 


ctcaacccgt 


ccacactctc 


cgtcgaagca 


23700 


QCCCCCCCQcl 


QZCa'tQCCQC 
y ** *-y *•* ^* y 


ccaaaaaaac 


caccaaacaa 


ctgatcgctc 


aacaaaaaac 

yy y y y 


23760 


aaaaaaacaa 


QciQCtQCtCC 
y u y u» v y v» u. vp. v» 


aacaacaaat 
y y yy *- 


aaaaaacaaa 
yy w yy uv *y u y 


gaagagatgt 


gggacagcca 


23820 


aacaaaoaaa 


atatcaaaaa 

y **y ^ v»**ytiyH 


acaaaaaaaa 
y yy yy 


aataaaaaac 


tgggacagcc 


tagacgagga 


23880 


aaaaaacGaa 


ctttcaaaaa 

v» u. u. L^uyuyy 


aaaaaacaac 

uu y u yy y **^" 


caaaaaaaaa 


ccacctacat 


ccaacacacc 


23940 


ttctctaaac 

v u» v. ^- v_ vy u.y v* 


caacaaccaa 

vy v*. w*. u*j v» Vp**jvp» 


aaccccaacc 


cccaacaccc 


ccaaccaact 


cactcaaaac 


24000 


caaccataaa 

*— **y *- u-y vuyy 


taaaacacca 

uy y y ua v» y u> v»ua 


ccaaatctcc 


aacaacaaca 

»»y y v.uy v.y 


acaacaacaa 

y »- ui *u. y y y 


caaataaaac 


24060 


caaacacaaa 


caacaaaaat 
^yy^yyyyy 


attactccta 


acaaacccac 
y yyy v«t* *.» 


aaaaacaata 

MM *A%A 23 ^**A 23 


ttataaacta 


24120 


cttacaacac 

Vp. U> tM^WM^UVa 


y y y yy y 




Ly\>\>v.yuv.y\. 


tacctcctct 

WUl N— U_ U. V_ V_ UiV Up 


"tccatcacaa 

Up Xpi u* upuAVcy y 


24180 


*- y <»yy u. lv. 


cctcacaaca 

v* v. vu_y v« uuv.y 


ttctctatta 


u> w u. Vj>r v>y i- V.U. v. 


ctrtacaacc 

Vp> Up Vp vuvpuy u>Ui 


cctacaaaac 

u> u> uuu.y ujuujuupU v» 


24240 


actcaaaaaa 

y *■ y y ***y ****** 


aaaaactaaa 


acctcctccb 


ccacaaaaaa 

v» v» y ~y *>*y y ***** 


aaactccacc 

UUU\> WU>U>M >• *p) 


accactacca 

yv.v.y v. u.y u- v»y 


24300 


ccaccaaaaa 


tccaccaacc 

W V* V* WA \v V* Vf m W* V*« 


accaaaaaac 


taaaaaaaca 


catctttccc 


actctatata 

IAN* w w 23 w ^a w 23 


24360 


ctatctttca 

V* w vA W V- WWW V# WA 


acaaaaccac 


aaacaacacc 

Vj *j v»* w»v V* 


ctcaacacaa 


actaaaaata 

*A W VJ VA \A VA ^A W ^A 


aaaaaccact 


24420 


ccttccactc 


actcacccac 

y v-» u» WiUv w v>y v> 


aactatctot 


accacaaaaa 


aaaaaaccaa 


ctacaacaca 


24480 


ccctaaacaa 


caccaaaaca 

Vp> *"J W «pJ UpV UpV «pJ VaV*. 


ctattxaaca 

ui u^jj u» uv»*y u»f»* 


aatactactc 

^A %A W VA V* W^4^v W V* 


aacatctctt 


aaaaactaaa 

****t*M*AW W *A VA \A 


24540 


aaacccacac 


tttttccccc 

Up U> U» Up V v» v» v» v» v» 


tcaaccacca 
yy y 


aaacccacat 


catcaccaac 

u. v»y u>\puyj v. 


ataaacaaaa 

ua u. y **y *■»*■*** y y 


24600 


aaattcccac 


v_ v. u* v. l. ci \_ u i_ y 


yy y 


aaccrraaat: 


yyy yy ^ 




24660 


cccaaaacta 

v» v» v* ua y y u* v* ku 


ctccaacaaa 

v» v. v« v» v*. vj uuuy 


ataaactaac 

*>*j uuui **y y 


*•*—*» y ** y *— *—y y 


cccccacata 

wuivuiviuiuiu y 


atctcacaaa 

UA U» Up. Up Vp VpV *p»y **y 


24720 


ttaacaacat 


ccaaacccac 


caaaaccaaa 


ttctcttaaa 

u> u» v« u. buyu 


acaaacaaca 

**u.uiyjyju»yjy v» v*. 


atcaccacca 

Up* Up V. U. V. \— \£ W X» U* 


24780 


caccccggcg 


ccaactcaac 


ccacctaatt 


aacccaccac 


ccaaatatat 

v»v*%pA>2323 23 


caaaaaaatc 


24840 


cccgcccgac 


cacaotcctc 


ctaccacaca 


acacaaaaac 

y yy yy 


caaaatcctc 


ataactaact 

** Up y UpU Vpi UpUpVUpU 1p W 


24900 


ctoaaataca 

v» *-y y y y u.sav»ua 


attaacaaac 


aaatccaaat 

yyy u*»u.uy y u> 


acaccaaa"ta 

uuiyvvuyy wu 


*-c*y**yy Luyy 


QCCQCtLCC't't 

y Up> Vp>y Up« u> Up> v> u> Up 


24960 


«— u. v. Lt.\,^yy 


aaatataaaa 

yuy lu v.uuu.y 


aoaa1*aa"tra 


I* Luy **y y *"»*^y 


t*yy u a. i.*«pV»Cly 


rt"f"aarnaro 


25020 

Cm J \J Cm \J 


aaacaataaa 


ctcctcaacc 


ggtctcagac 


ctgacggagt 


*»» u. uuvaij v. u, u» 


yyayyayu^yy 


25080 


accactcttc 


cttcaccact 


cgccaggcct 


acctgaccct 


yuayayv u. v- u> 


Up u— v_ t, Vp y v» v*. y Up. 


25140 


cqcactccaa 


aaaaatcaac 

y yy**** L ^yy *- 


actctccagt ( tcgtggaaga 


attcottccc 


tccatctact 

U. Xpt U_y Up Vp kUU> Up 


25200 


tcaacccctt 


ctccggctcg 


cctggacgct acccggacgc 


cttcattccc 

Up U» U. VpUpV Up U»Up>Up>U» 


aactttaaca 

uaupUVpi Up Up u> y ua Vp y 


25260 


cagtgagtga 


atccgtggac 


ggctacgact gatgacagat 


ggtgcggccg 


tgagagctcg 


25320 


gctgcgacat 


ctgcatcact 


gccgtcagcc tcgctgctac 


gctcgggagg 


cgatcgtctt 


25380 


cagctacttt 


gagctgccgg 


acgagcaccc tcagggtccg 


gctcacgggt 


tgaaactcga 


25440 
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gatcgagaac 


gcgctcgagt 


ctcgcctcat 


cgacaccttc 


accgcccgac 


ctctcctggt 


25500 


agaaatccaa 


cgggggatca 


ctaccatcac 


cctgttctgc 


atctgcccca 


cgcccggatt 


25560 


acatgaagat 


ctgtgttgtc 


atctttgcgc 


tcagtttaat 


aaaaactgaa 


ctttttgccg 


25620 


caccttcaac 


gccatctgtg 


atttctacaa 


caaaaagttc 


ttctggcaaa 


ggtacacaaa 


25680 


ctgtatttta 


ttctaattct 


acctcatcta 


tcgtgctgaa 


ctgcgcctgc 


actaacgaac 


25740 


ttatccagtg 


gattgcaaac 


ggtagtgtgt 


gcaagtactt 


ttgggggaac 


gatatagtta 


25800 


gtagaaataa 


cagcctttgc 


gagcactgca 


actcctccac 


actaatcctt 


tatcccccat 


25860 


ttgttactgg 


atggtatatg 


tgcgttggct 


ccggtttaaa 


tcctagttgc 


tttcataagt 


25920 


ggtttctaca 


aaaagagacc 


cttcccaaca 


attctgtttc 


ttttttcgcc 


ctatcctact 


25980 


gctgttctcc 


ctctggttac 


tctttcaaac 


ctctaattgg 


tattttagct 


ttgatactca 


26040 


taatctttat 


taactttata 


ataattaaca 


acttacagta 


aacatgcttg 


ttctactgct 


26100 


cgccacatct 


ttcgctctct 


ctcacgccag 


aacaagtatt 


gttggcgcag 


gttacaatgc 


26160 


aactcttcaa 


tctgcttaca 


tgccagattc 


cgaccagata 


ccccatatta 


cgtggtactt 


26220 


acaaacctcc 


aaacctaatt 


cttcatttta 


tgaaggaaac 


aaactctgcg 


atgactccga 


26280 


caacagaacg 


cacacatttc 


cccacccttc 


actacaattc 


gaatgcgtaa 


acaaaagctt 


26340 


gaagctttac 


aacttaaagc 


cttcagattc 


tggcttgtac 


catgctgtag 


ttgaaaaaag 


26400 


taatttagaa 


gtccacagtg 


attacattga 


attgacggtt 


gtggacctgc 


cacctccaaa 


26460 


atgtgaggtt 


tcctcctctt 


accttgaagt 


tcaaggcgtg 


gatgcctact 


gcctcataca 


26520 


cattaactgc 


agcaactcta 


aatatccagc 


tagaatttac 


tataatggac 


aggaaagtaa 


26580 


tcttttttat 


tatttaacaa 


caagcgctgg 


taacggtaaa 


cagttacctg 


actattttac 


26640 


tgctgttgtt 


gaattttcca 


cctacagaga 


aacgtatgcc 


aagcggcctt 


acaatttctc 


26700 


atacccgttt 


aacgaccttt 


gcaatgaaat 


acaagcgctc 


gaaactggaa 


ctgattttac 


26760 


tccaattttc 


attgctgcca 


ttgttgtaag 


cttaattacc 


attattgtca 


gcctagcatt 


26820 


ttactgcttt 


tacaagccca 


aaaaccctaa 


gtttgaaaaa 


cttaaactaa 


aacctgtcat 


26880 


tcaacaagtg 


tgattttgtt 


ttccagcatg 


gtagctgcat 


ttctacttct 


cctctgtcta 


26940 


cccatcattt 


tcgtctcttc 


aactttcgcc 


gcagtttccc 


acctggaacc 


agagtgccta 


27000 


ccgccttttg 


acgtgtatct 


gattctcacc 


tttgtttgtt 


gtatatccat 


ttgcagtata 


27060 


gcctgctttt 


ttataacaat 


ctttcaagcc 


gccgactatt 


tttacgtgcg 


aattgcttac 


27120 


tttagacacc 


atcctgaata 


cagaaatcaa 


aacgttgcct 


ccttactttg 


tttggcatga 


27180 


ttaagttatt 


gctgatactt 


aattatttac 


ccctaatcaa 


ctgtaattgt 


ccattcacca 


27240 


aaccctggtc 


attctacacc 


tgttatgata 


aaatccccga 


cactcctgtt 


gcttggcttt 


27300 


acgcagccac 


cgccgctttg 


gtatttatat 


ctacttgcct 


tggagtaaaa 


ttgtatttta 


27360 


ttttacacac 


tgggtggcta 


catcccagag 


aagatttacc 


tagatatcct 


cttgtaaacg 


27420 


cttttcaatt 


acagcctctg 


cctcctcctg 


atcttcttcc 


tcgagctccc 


tctattgtga 


27480 
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actactttca 


actcaccqqt 


qqaqatqact 


gactctcagg 


acattaatat 


tagtgtggaa 


27540 


aaaataacta 


ctcagcgtca 


qcqaqaaacq 


cgagtgttgg 

3 3 13 ""33 


aatacctgga 


actacagcaa 


27600 


cttaaaaaat 


cccactaata 


tqaqaaaqqa 


qtqctqtgcc 

zj w 3 3 3 


atgttaagca 


ggcagccctt 


27660 


tcctacaata 


tcagcgttca 


gggacatgaa 


ctgtcttaca 


ctttgccttt 


gcagaaacaa 


27720 


accttctgca 


ccatqatqqq 


ctctacctcc 


atcacaatca 


cccaacaagc 


cgggcctgta 


27780 


aaaaaaacta 


tcctctgtca 


ctgtcacgca 


cctgattgca 


tgtccaaact 


aatcaaaact 


27840 


ctctgtgctt 


taggtgatat 


ttttaaggtg 


taaatcaata 


ataaacttac 


cttaaatttg 


27900 


acaacaaatt 


tctaataaca 


tcattcagca 


gcaccacttt 


accctcttcc 


cagctctcgt 


27960 


atqqqatacq 


ataqtqqqtq 


gcaaacttcc 


tccaaaccct 


aaaagaaata 


ttggtatcca 


28020 


cttccttgtc 


ctcacccaca 


attttcatct 


tttcatagat 


gaaaagaacc 


agagttgatg 


28080 


aagacttcaa 


ccccgtctac 


ccctatgaca 


ccacaaccac 


tcctgcagtt 


ccctttatat 


28140 


cacccccctt 


tgtaaacagc 


gatggtcttc 


aggaaaaccc 


cccaggtgtt 


ttaagtctgc 


28200 


gaatagctaa 


acccctatat 


ttcgacatgg 


agagaaaact 


agccctttca 


cttggaagag 


28260 


ggttgacaat 


taccgccgcc 


ggacaattag 


aaagtacgca 


gagcgtacaa 


accaacccac 


28320 


cattaataat 


taccaacaac 


aacacactga 


ccctacgtca 


ttctcccccc 


ttaaacctaa 


28380 


ctgacaatag 


cttagtgcta 


ggctactcga 


gtcctctccg 


cgtcacagac 


aacaaactta 


28440 


catttaactt 


cacatcacca 


ctccgttatg 


aaaatgaaaa 


ccttactttt 


aactatacag 


28500 


agcctcttaa 


acttataaat 


aacagccttg 


ccattgacat 


caattcctca 


aaaggcctta 


28560 


QtaacatcQQ 

3 *-**3 ^-3 t-v -33 


aggctcacta 


gctgtaaacc 


tgagttcaga 


cttaaagttt 


gacagcaacg 


28620 


gatccatagc 


ttttggcata 


caaaccctgt 


ggaccgctcc 


gacctcgact 


ggcaactgca 


28680 


ccgtctacag 


cqaqqqcqat 


tccctactta 


gtctctgttt 


aaccaaatgc 


ggagctcacg 


28740 


tcttaggaag 


tgtaagttta 


accggtttaa 


caggaaccat 


aacccaaatg 


actgatattt 


28800 


ctgtcaccat 


tcaatttaca 


tttgacaaca 


atggtaagct 


actaagctct 


ccacttataa 


28860 


acaacgcctt 


tagtattcga 


cagaatgaca 


gtacggcctc 


aaaccctacc 


tacaacgccc 


28920 


tggcgtttat 


gcctaacagt 


accatatatg 


caagaggggg 


aggtggtgaa 


ccacgaaaca 


28980 


actactacgt 


ccaaacgtat 


cttaqqqqaa 


atgttcaaaa 


accaatcatt 


cttactgtaa 


29040 


cctacaactc 


agtcgccaca 


ggatattcct 


tatcttttaa 


gtggactgct 

3 w 33~~~ 3 


cttgcacgtg 


29100 


aaaagtttgc 


aaccccaaca 


acctcgtttt 


gctacattac 


agaacaataa 


aaccgtgtac 


29160 


cccaccgttt 


cgtttttttc 


agatgaaacg 


ggcgagagtt 
33 3 3 3 ™ • 


gatgaagact 


tcaacccagt 


29220 


gtacccttat 


gaccccccac 


atgctcctgt 


tatgcccttc 


attactccac 


cttttacctc 


29280 


ctcggatggg 

33~~ "Saa 


ttgcaggaaa 


aaccacttgg 


agtgttaagt 


ttaaactaca 


gagatcccat 


29340 


tactacgcaa 


aatgagtctc 


ttacaattaa 


actaggaaac 


ggcctcactc 


tagacaacca 


29400 


gggacaacta 


acatcaaccg 


ctggcgaagt 


agaaccrcca 


crcactaacg 




?Q4fin 


acttgcactg 


gtctatagcg 


atcctttagc 


agtaaagcgc 
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tataaatgac 


aaggacaaac 


tagccctgca 


aacagccgcg 


ccccttgtaa 


ctaacgctgg 


29640 
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aattttaaat 
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tcgctcttag 


30120 


tgctgctttg 


cctttacaat 


acactgatca 


gcttcgcctt 


aacgtgggcg 


ctgggctgcg 
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ttacaatcca 


gtgagtaaga 


aattggacgt 


gaaccccaat 


caaaacaagg 


gtttaacctg 


30240 


ggaaaatgac 


tacctcattg 


taaagctagg 


aaatggatta 


ggttttgatg 
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30480 
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ttttgacaat 
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tgcactagca 
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gaactgtttc 
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tttctcttac 
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ttcaggctct 
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atctcactga 


gattccggca 


agaatcgcac 


accatcacaa 


tcaggttgtt 


catgatccca 


31380 


tagctgaaca 


cgctccagcc 


aaagctcatt 


cgctccaaca 


gcgccaccgc 


gtgtccgtcc 
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aaccttactt 


taacataaat 


caggtgtctg 


ccgcgtacaa 


acatgctacc 


cgcatacaga 


31500 


acctcccggg 


gcaaacccct 


gttcaccacc 


tgcctgtacc 


agggaaacct 


cacatttatc 


31560 


agggagccat 


agatagccat 


tttaaaccaa 


ttagctaaca 


ccgccccacc 


agctctacac 


31620 
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cctccccgct cattatcata ttggcacgtt tccagaataa ggtatattat tgatgcagca 33780 

aaacaatccc tctgggagca atcacaaaat cctccggtga aaaaagcaca tacatattag 33840 

aataaccctg ttgctggggc aaaaaggccc gtcgtcccag caaatgcaca taaatatgtt 33900 

catcagccat tgccccgtct taccgcgtaa acagccacga aaaaatcgag ctaaaatcca 33960 

cccaacagcc tatagctata tatacactcc acccaatgac gctaataccg caccacccac 34020 

gaccaaagtt cacccacacc cacaaaaccc gcgaaaatcc agcgccgtca gcacttccgc 34080 

aatttcagtc tcacaacgtc acttccgcgc gccttttcac tttcccacac acgcccttcg 34140 

cccgcccgcc ctcgcgccac cccgcgtcac cccacgtcac cgcacgtcac cccggccccg 34200 

cctcgctcct ccccgctcat tatcatattg gcacgtttcc agaataaggt atattattga 34260 

tgca 34264 

<210> 5 
<211> 31044 
<212> DNA 

<213> simian adenovirus SV-25 
<400> 5 

catcatcaat aatatacctt attctggaaa cgtgccaata tgataatgag cggggaggag 60 

cgaggcgggg ccggggtgac gtgcggtgac gcggggtggc gcgagggcgg ggcgaagggc 120 

gcgggtgtgt gtgtgggagg cgcttagttt ttacgtatgc ggaaggaggt tttataccgg 180 

aagatgggta atttgggcgt atacttgtaa gttttgtgta atttggcgcg aaaactgggt 240 

aatgaggaag ttgaggttaa tatgtacttt ttatgactgg gcggaatttc tgctgatcag 300 

cagtgaactt tgggcgctga cggggaggtt tcgctacgtg acagtaccac gagaaggctc 360 

aaaggtccca tttattgtac tcttcagcgt tttcgctggg tatttaaacg ctgtcagatc 420 

atcaagaggc cactcttgag tgctggcgag aagagttttc tcctccgtgc tgccacgatg 480 

aggctggtcc ccgagatgta cggtgttttt agcgacgaga cggtgcgtaa ctcagatgac 540 

ctgctgaatt cagacgcgct ggaaatttcc aattcgcctg tgctttcgcc gccgtcactt 600 

cacgacctgt ttgtgttttg gctcaacgct tagcaacgtg ttatataggg tcaagaagga 660 

gcaggagacg cagtttgcta ggctgttggc cgatactcct ggagtttttg tggctctgga 720 

tctaggccat cactctcttt tccaagagaa aattatcaaa aacttaactt ttacgtctcc 780 

tggtcgcacg gttgcttccg ctgcctttat tacctatatt ttggatcaat ggagcaacag 840 

cgacagccac ctgtcgtggg agtacatgct ggattacatg tcgatggcgc tgtggagggc 900 

catgctgcgg aggagggttt gcatttactt gcgggcgcag cctccgcggc tggaccgagt 960 

ggaggaggag gacgagccgg gggagaccga gaacctgagg gccgggctgg accctccaac 1020 

ggaggactag gtgctgagga tgatcccgaa gaggggacta gtggggctag gaagaagcaa 1080 

aagactgagt ctgaacctcg aaactttttg aatgagttga ctgtgagttt gatgaatcgt 1140 

cagcgtccgg agacaatttt ctggtctgaa ttggaggagg aattcaggag gggggaactg 1200 

aacctgctat acaagtatgg gtttgaacag ttaaaaactc actggttgga gccgtgggag 1260 
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gattttgaaa ccgccttgga cacttttgct aaagtggctc tgcggccgga taaggtttac 1320 

actatccgcc gcactgttaa cataaagaag agtgtttatg ttataggcca tggagctctg 1380 

gtgcaggtgc aaaccgtcga ccgggtggcc tttagttgcg gtatgcaaaa tctgggcccc 1440 

ggggtgatag gcttaaatgg tgtaacattt cacaatgtaa ggtttactgg tgaaagtttt 1500 

aacggctctg tgtttgcaaa taacacacag ctgacgctcc acggcgttta cttttttaac 1560 

tttaataaca catgtgtgga gtcgtggggc agggtgtctt tgaggggctg ctgttttcac 1620 

ggctgctgga aggcggtggt gggaagactt aaaagtgtaa catctgtaaa aaaatgcgtg 1680 

tttgagcggt gtgtgttggc tttaactgtg gagggctgtg gacgcattag gaataatgcg 1740 

gcgtctgaga atggatgttt tcttttgcta aaaggcacgg ctagtattaa gcataacatg 1800 

atatgcggca gcggtctgta cccttcacag ctgttaactt gcgcggatgg aaactgtcag 1860 

accttgcgca ccgtgcacat agcgtcccac cagcgccgcg cctggccaac attcgagcac 1920 

aatatgctta tgcgttgtgc cgtccacttg ggccctaggc gaggcgtgtt tgtgccttac 1980 

cagtgtaact ttagccatac caagatttta ctagaacctg ataccttctc tcgagtgtgt 2040 

ttcaatgggg tgtttgacat gtcaatggaa ctgtttaaag tgataagata tgatgaatcc 2100 

aagtctcgtt gtcgcccatg tgaatgcgga gctaatcatc tgaggttgta tcctgtaacc 2160 

ctaaacgtta ccgaggagct gaggacggat caccacatgt tgtcctgcct gcgcaccgac 2220 

tatgaatcca gcgacgagga gtgaggtgag gggcggagcc acaaagggta taaaggggcg 2280 

tgaggggtgg gtgtgatgat tcaaaatgag cgggacgacg gacggcaacg cgtttgaggg 2340 

tggagtgttc agcccttatc tgacatctcg tcttccttcc tgggcaggag tgcgtcagaa 2400 

tgtagtgggc tccaccgtgg acggacgacc ggtcgcccct gcaaattccg ccaccctcac 2460 

ctatgccacc gtgggatcat cgttggacac tgccgcggca gctgccgctt ctgctgccgc 2520 

ttctactgct cgcggcatgg cggctgattt tggactgtat aaccaactgg ccactgcagc 2580 

tgtggcgtct cggtctctgg ttcaagaaga tgccctgaat gtgatcctga ctcgcctgga 2640 

gatcatgtca cgtcgcttgg acgaactggc tgcgcagata tcccaagcta accccgatac 2700 

cacttcagaa tcctaaaata aagacaaaca aatatgttga aaagtaaaat ggctttattt 2760 

gttttttttg gctcggtagg ctcgggtcca cctgtctcgg tcgttaagaa ctttgtgtat 2820 

gttttccaaa acacggtaca gatgggcttg gatgttcaag tacatgggca tgaggccatc 2880 

tttggggtga agataggacc attgaagagc gtcatgctcc ggggtggtgt tgtaaattac 2940 

ccagtcgtag cagggtttct gggcgtggaa ctggaagatg tcctttagga gtaggctgat 3000 

ggccaagggc aggcccttag tgtaggtgtt tacaaagcgg ttaagctggg agggatgcat 3060 

gcggggggag atgatatgca tcttggcttg gatcttgagg ttagctatgt taccacccag 3120 

gtctctgcgg gggttcatgt tatgaaggac caccagcacg gtgtagccgg tgcatttggg 3180 

gaacttgtca tgcagtttgg aggggaaggc gtggaagaat ttagagaccc ccttgtggcc 3240 

ccctaggttt tccatgcact catccataat gatggcaatg ggacccctgg cggccgcttt 3300 
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ggcaaacacg ttttgggggt tggaaacatc atagttttgc tctagagtga gctcatcata 3360 

ggccatctta acaaagcggg gtaggagggt gcccgactgg gggatgatag ttccatctgg 3420 

gcctggggcg tagttaccct cacagatctg catctcccag gccttaattt ccgagggggg 3480 

tatcatgtcc acctgggggg caataaagaa cacggtttct ggcgggggat tgatgagctg 3540 

ggtggaaagc aagttacgca gcagttgaga tttgccacag ccggtggggc cgtagatgac 3600 

cccgatgacg ggttgcagct ggtagttgag agaggaacag ctgccgtcgg ggcgcaggag 3660 

gggggctacc tcattcatca tgcttctaac atgtttattt tcactcacta agttttgcaa 3720 

gagcctctcc ccacccaggg ataagagttc ttccaggctg ttgaagtgtt tcagcggttt 3780 

taggccgtcg gccatgggca tcttttcgag cgactgacga agcaagtaca gtcggtccca 3840 

gagctcggtg acgtgctcta tggaatctcg atccagcaga cttcttggtt gcgggggttg 3900 

ggtcgacttt cgctgtaggg caccagccgg tgggcgtcca gggccgcgag ggttctgtcc 3960 

ttccagggtc tcagcgtccg ggtgagggtg gtctcggtga cggtgaaggg atgagccccg 4020 

ggctgggcgc ttgcgagggt gcgcttcagg ctcatcctgc tggtgctgaa gcggacgtcg 4080 

tctccctgtg agtcggccag atagcaacga agcatgaggt cgtagctgag ggactcggcc 4140 

gcgtgtccct tggcgcgcag ctttcccttg gaaacgtgct gacatttggt gcagtgcaga 4200 

cattggaggg cgtagagttt gggggccagg aagaccgact cgggcgagta ggcgtcggct 4260 

ccgcactgag cgcagacggt ctcgcactcc actagccacg tgagctcggg tttagcggga 4320 

tcaaaaacca agttgcctcc attttttttg atgcgtttct taccttgcgt ttccatgagt 4380 

ttgtggcccg cttccgtgac aaaaaggctg tcggtgtctc cgtagacaga cttgaggggg 4440 

cgatcttcca aaggtgttcc gaggtcttcc gcgtacagga actgggacca ctccgagacg 4500 

aaggctctgg tccaggctaa cacgaaggag gcaatctgcg aggggtatct gtcgttttca 4560 

atgagggggt ccaccttttc cagggtgtgc agacacaggt cgtcctcctc cgcgtccacg 4620 

aaggtgattg gcttgtaagt gtaggtcacg tgatctgcac cccccaaagg ggtataaaag 4680 

ggggcgtgcc caccctctcc gtcactttct tccgcatcgc tgtggaccag agccagctgt 4740 

tcgggtgagt aggccctctc aaaagccggc atgatctcgg cgctcaagtt gtcagtttct 4800 

acaaacgagg tggatttgat attcacgtgc cccgcggcga tgcttttgat ggtggagggg 4860 

tccatctgat cagaaaacac gatctttttg ttgtcaagtt tggtggcgaa agacccgtag 4920 

agggcgttgg aaagcaactt ggcgatggag cgcagggtct gatttttctc ccgatcggcc 4980 

ctctccttgg cggcgatgtt gagttgcacg tactcccggg ccgcgcaccg ccactcgggg 5040 

aacacggcgg tgcgctcgtc gggcaggatg cgcacgcgcc agccgcgatt gtgcagggtg 5100 

atgaggtcca cgctggtagc cacctccccg cggaggggct cgttggtcca acacaatcgc 5160 

cccccttttc tggagcagaa cggaggcagg ggatctagca agttggcggg cggggggtcg 5220 

gcgtcgatgg tgaagatacc gggtagcagg atcttattaa aataatcgat ttcggtgtcc 5280 

gtgtcttgca acgcgtcttc ccacttcttc accgccaggg ccctttcgta gggattcagg 5340 

ggcggtcccc agggcatggg gtgggtcagg gccgaggcgt acatgccgca gatgtcatac 5400 

Page 73 



WO 2005/001103 



PCT/US2004/016614 



acgtacaggg 


gttccctcaa 


caccccgatg 


taagtggggt 


aacagcgccc 


cccgcggatg 


5460 


ctggctcgca 


cgtagtcgta 


catctcgcgc 


gagggagcca 


tgaggccgtc 


tcccaagtgg 


5520 


gtcttgtggg 


gtttttcggc 


ccggtagagg 


atctgtctga 


agatggcgtg 


ggagttggaa 


5580 


gagatggtgg 


ggcgttggaa 


gacgttaaag 


ttggccccgg 


gtagtcccac 


ggagtcttgg 


5640 


atgaactggg 


cgtaggattc 


ccggagtttg 


tccaccaggg 


cggcggtcac 


cagcacgtcg 


5700 


agagcgcagt 


agtccaacgt 


ctcgcggacc 


aggttgtagg 


ccgtctcttg 


ttttttctcc 


5760 


cacagttcgc 


ggttgaggag 


gtattcctcg 


cggtctttcc 


agtactcttc 


ggcgggaaat 


5820 


cctttttcgt 


ccgctcggta 


agaacctaac 


atgtaaaatt 


cgttcaccgc 


tttgtatgga 


5880 


caacagcctt 


tttctaccgg 


cagggcgtac 


gcttgagcgg 


cctttctgag 


agaggtgtgg 


5940 


gtgagggcga 


aggtgtcccg 


caccatcact 


ttcaggtact 


gatgtttgaa 


gtccgtgtcg 


6000 


tcgcaggcgc 


cctgttccca 


cagcgtgaag 


tcggtgcgct 


ttttctgcct 


gggattgggg 


6060 


agggcgaagg 


tgacatcgtt 


aaagagtatt 


ttcccggcgc 


ggggcatgaa 


gttgcgagag 


6120 


atcctgaagg 


gcccgggcac 


gtccgagcgg 


ttgttgatga 


cctgcgccgc 


caggacgatc 


6180 


tcgtcgaagc 


cgttgatgtt 


gtgacccacg 


atgtaaagtt 


cgatgaagcg 


cggctgtccc 


6240 


ttgagggccg 


gcgctttttt 


caactcctcg 


taggtgagac 


agtccggcga 


ggagagaccc 


6300 


agctcagccc 


gggcccagtc 


ggagagttga 


ggattagccg 


caaggaagga 


gctccataga 


6360 


tccaaggcca 


ggagagtttg 


caagcggtcg 


cggaactcgc 


ggaacttttt 


ccccacggcc 


6420 


attttctccg 


gtgtcactac 


gtaaaaggtg 


ttggggcggt 


tgttccacac 


gtcccatcgg 


6480 


agctctaggg 


ccagctcgca 


ggcttggcga 


acgagggtct 


cctcgccaga 


gacgtgcatg 


6540 


accagcataa 


agggtaccaa 


ctgtttcccg 


aacgagccca 


tccatgtgta 


ggtttctacg 


6600 


tcgtaggtga 


caaagagccg 


ctgggtgcgc 


gcgtgggagc 


cgatcggaaa 


gaagctgatc 


6660 


tcctgccacc 


agctggagga 


atgggtgtta 


atgtggtgga 


agtagaagtc 


ccgccggcgc 


6720 


acagagcatt 


cgtgctgatg 


tttgtaaaag 


cgaccgcagt 


agtcgcagcg 


ctgcacgctc 


6780 


tgtatctcct 


gaacgagatg 


cgcttttcgc 


ccgcgcacca 


gaaaccggag 


ggggaagttg 


6840 


agacgggggg 


ctggtggggc 


gacatcccct 


tcgccttggc 


ggtgggagtc 


tgcgtctgcg 


6900 


tcctccttct 


ctgggtggac 


gacggtgggg 


acgacgacgc 


cccgggtgcc 


gcaagtccag 


6960 


atctccgcca 


cggaggggtg 


caggcgctgc 


aggaggggac 


gcagctgccc 


gctgtccagg 


7020 


gagtcgaggg 


aagtcgcgct 


gaggtcggcg 


ggaagcgttt 


gcaagttcac 


tttcagaaga 


7080 


ccggtaagag 


cgtgagccag 


gtgcagatgg 


tacttgattt 


ccaggggggt 


gttggatgaa 


7140 


gcgtccacgg 


cgtagaggag 


tccgtgtccg 


cgcggggcca 


ccaccgtgcc 


ccgaggaggt 


7200 


tttatctcac 


tcgtcgaggg 


cgagcgccgg 


ggggtagagg 


cggctctgcg 


ccggggggca 


7260 


gcggaggcag 


aggcacgttt 


tcgtgaggat 


tcggcagcgg 


ttgatgacga 


gcccggagac 


7320 


tgctggcgtg 


ggcgacgacg 


cggcggttga 


ggtcctggat 


gtgccgtctc 


tgcgtgaaga 


7380 


ccaccggccc 


ccgggtcctg 


aacctaaaga 


gagttccaca 


gaatcaatgt 


ctgcatcgtt 


7440 
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ZJ ZJ ZJ ZJ ZJ ZJ 
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gtagccgacc 


aggaagtgag 


gaggcggctc 
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cggtggcggg 
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ggcgccgggg 


gacaggtcgt 
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tggtagatgt 
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tggttgcggc 
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gctccatggt 


gggcacgctc 
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ccggtgctgc 
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ygcagaxgcg 


dCCCCdydt_y 


a rtarra 
LtLaLlatLd 


ccgccg tggc 


nnrantaaar 
yyLay Lddav. 


\- Lydy uyydy 




gcggtgacag 


ggaggaggaa 


gagctggctt 


tagacctgga 


agagggagag 


gggctggccc 


9480 


ggctgggagc 


gccatcccca 


gagagacacc 


ctagggttca 


gctcgtgagg 


gacgccaggc 


9540 
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aggcttttgt 
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cggtcaggag 
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tgcgcgattg 
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tccaaaagag 


ctttaacaat 


cacgtgagga 


ccctgatcgc 
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gtgaccatcg 


9840 


ggctgatgca 


tctgtgggac 
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cctacgtgca 


gaacccggct 
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9900 


tgacggccca 


gctgttcctg 


atcgtgcagc 


acagccgcga 


caacgagacg 


ttccgcgacg 
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ccatgttgaa 


catcgcggag 


cccgagggtc 


gctggctctt 


ggatctgatt 


aacatcctgc 


10020 


agagcatcgt 


ggtgcaggag 


aggggcctga 


gtttagcgga 


caaggtggcg 


gccattaact 


10080 


attcgatgca 


gagcctgggg 


aagttctacg 


ctcgcaagat 


ctacaagagc 


ccttacgtgc 


10140 


ccatagacaa 


ggaggtgaag 


atagacagct 


tttacatgcg 


catggcgctg 


aaggtgctga 


10200 


cgctgagcga 


cgacctcggc 


gtgtaccgta 


acgacaagat 


ccacaaggcg 


gtgagcgcca 


10260 


gccgccggcg 


ggagctgagc 


gacagggagc 


tgatgcacag 


cctgcagagg 


gcgctggcgg 


10320 


gcgccgggga 


cgaggagcgc 


gaggcttact 


tcgacatggg 


agccgatctg 


cagtggcgtc 


10380 


ccagcgcgcg 


cgccttggag 


gcggcgggtt 


atcccgacga 


ggaggatcgg 


gacgatttgg 


10440 


aggaggcagg 


cgagtacgag 


gacgaagcct 


gaccgggcag 


gtgttgtttt 


agatgcagcg 


10500 


gccggcggac 


gggaccaccg 


cggatcccgc 


acttttggca 


tccatgcaga 


gtcaaccttc 


10560 


gggcgtgacc 


gcctccgatg 


actgggcggc 


ggccatggac 


cgcatcatgg 


cgctgaccac 


10620 


ccgcaacccc 


gaggctttta 


ggcagcaacc 


ccaggccaac 


cgtttttcgg 


ccatcttgga 


10680 


agcggtggtg 


ccgtcgcgca 


ccaacccgac 


gcacgagaaa 


gtcctgacta 


tcgtgaacgc 


10740 


cctggtagac 


agcaaggcca 


tccgccgtga 


cgaggcgggc 


ttgatttaca 


acgctctttt 


10800 


ggaacgcgtg 


gcgcgctaca 


acagcactaa 


cgtgcagacc 


aatctggacc 


gcctcaccac 


10860 


cgacgtgaag 


gaggcgctgg 


cgcagaagga 


gcggtttctg 


agggacagta 


atctgggctc 


10920 


tctggtggca 


ctgaacgcct 


tcctgagctc 


acagccggcc 


aacgtgcccc 


gcgggcagga 


10980 


ggattacgtg 


agcttcatca 


gcgctctgag 


actgctggtg 


tccgaggtgc 


cccagagcga 


11040 


ggtgtaccag 


tctgggccgg 


attacttttt 


ccagacgtcc 


cgacagggct 


tgcaaacggt 


11100 


gaacctgact 


caggccttta 


aaaacttgca 


aggcatgtgg 


ggggtcaagg 


ccccggtggg 


11160 


cgatcgcgcc 


actatctcca 


gtctgctgac 


ccccaacact 


cgcctgctgc 


tgctcttgat 


11220 


cgcaccgttt 


accaacagta 


gcactatcag 


ccgtgactcg 


tacctgggtc 


atctcatcac 


11280 


tctgtaccgc 


gaggccatcg 


gccaggctca 


gatcgacgag 


catacgtatc 


aggagattac 


11340 


taacgtgagc 


cgtgccctgg 


gtcaggaaga 


taccggcagc 


ctggaagcca 


cgttgaactt 


11400 


tttgctaacc 


aaccggaggc 


aaaaaatacc 


ctcccagttc 


acgttaagcg 


ccgaggagga 


11460 


gaggattctg 


cgatacgtgc 


agcagtccgt 


gagcctgtac 


ttgatgcgcg 


agggcgccac 


11520 


cgcttccacg 


gctttagaca 


tgacggctcg 


gaacatggaa 


ccgtcctttt 


actccgccca 


11580 
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17400 


aCtCCQQQCQ 
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17520 
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taggttttag 


agataatttt 
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a 9 ttgtccta 
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tgttttcctc tgggaggaat ggtggtgact gacaattaca actctgtgac gcctcaaaat 17880 

ggaggcagtg gaaatacatg gcaggcagac aatactacat ttagtcaaag aggagcgcag 17940 

attggctccg gaaacatgtt tgccctggaa attaacctac aggccaacct ctggcgcggc 18000 

ttcttgtatt ccaatattgg gttgtatctt ccagactctc tgaaaatcac ccccgacaac 18060 

atcacgctgc cagaaaacaa aaacacttat cagtacatga acggtcgcgt aacgccaccc 18120 

gggctcatag acacctatgt aaacgtgggc gcgcgctggt cccccgatgt catggacagc 18180 

attaacccct tcaaccacca ccgtaacgcg ggcttgcgct accgctccat gctcttgggc 18240 

aacggccgtt atgtgccttt tcacattcag gtgccccaaa aattctttgc cattaaaaac 18300 

ctgctgcttc tccccggttc ctatacctat gagtggaact tccgcaagga tgtcaacatg 18360 

atcctgcaga gctcgctggg taatgacctg cgagtggacg gggccagcat acgctttgac 18420 

agcattaacc tgtatgccaa cttttttccc atggcccaca acacggcctc taccctggaa 18480 

gccatgctgc gcaacgacac caatgaccag tccttcaacg actacctgtg cgcggctaac 18540 

atgctgtacc ccatccccgc caacgccacc agcgtgccca tttctattcc ttctcggaac 18600 

tgggctgcct tcaggggctg gagttttact cgcctcaaaa ccaaggagac tccctcgctg 18660 

ggctccggtt ttgaccccta ctttgtttac tccggctcca ttccctacct agatggcacc 18720 

ttttacctca accacacttt caaaaaggtg tctattatgt ttgactcctc ggttagctgg 18780 

cccggcaacg accgcctgct aacgcccaac gagttcgaaa ttaagcgttc cgtggacggt 18840 

gaagggtaca acgtggccca gagcaacatg accaaggact ggtttctaat tcaaatgctc 18900 

agtcactata atataggtta ccagggcttc tatgtgcccg agaactacaa ggaccgcatg 18960 

tactccttct tccgcaactt ccaaccaatg agccggcagg tggtagatac cgtgacttat 19020 

acagactaca aagatgtcaa gctcccctac caacacaaca actcagggtt cgtgggctac 19080 

atgggaccca ccatgcgaga gggacaggcc tacccggcca actatcccta ccccctgatc 19140 

ggagagactg ccgtacccag cctcacgcag aaaaagttcc tctgcgaccg ggtgatgtgg 19200 

aggataccct tctctagcaa ctttatgtcg atgggctccc tcaccgacct ggggcagaac 19260 

atgctgtacg ccaactccgc tcacgccttg gacatgactt ttgaggtgga tcccatggat 19320 

gagcccacgc ttctctatgt tctgtttgaa gtcttcgacg tggtgcgcat ccaccagccg 19380 

caccgcggcg tcatcgaggc cgtctacctg cgcacacctt tctctgccgg taacgccacc 19440 

acctaaagaa gctgatgggt tccagcgaac aggagttgca ggccattgtt cgcgacctgg 19500 

gctgcgggcc ctgctttttg ggcaccttcg acaagcgttt tcccggattc atgtcccccc 19560 

acaagccggc ctgcgccatc gttaacacgg ccggacggga gacagggggg gtgcactggc 19620 

tcgccttcgc ctggaacccg cgcaaccgca cctgctacct gttcgaccct tttggtttct 19680 

ccgacgaaag gctgaagcag atctaccaat tcgagtacga ggggctcctc aagcgcagcg 19740 

ctctggcctc cacgcccgac cactgcgtca ccctggaaaa gtccacccag acggtccagg 19800 

ggcccctctc ggccgcctgc gggcttttct gttgcatgtt tttgcacgcc ttcgtgcact 19860 
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aaaaaccagc gcattcccct ttcctgccgg gccaaccgca ccgcggccga taggaagcta 22020 

acactcagaa acggagtcag catacctgat atcacgtcac tggaggaagt gcctaagatc 22080 

ttcgagggtc tgggtcgaga tgagaagcgg gcggcgaacg ctctgcagaa agaacagaaa 22140 

gagagtcaga acgtgctggt ggagctggag ggggacaacg cgcgtctgac cgtcctcaaa 22200 

cgttgcatag aagtttccca cttcgcctac ccggccctca acctgccgcc caaagttatg 22260 

aaatcggtca tggaccagct actcatcaag agagctgagc ccctgaatcc cgaccaccct 22320 

gaggcggaaa actcagagga cggaaagccc gtcgtcagcg acgaggagct cgagcggtgg 22380 

ctggaaacca gggaccccca gcagttgcaa gagaggcgca agatgatgat ggcggccgtg 22440 

ctggtcacgg tggagctaga atgcctgcaa cggtttttca gcgacgtgga gacgctacgc 22500 

aaaatcgggg agtccctgca ctacaccttc cgccagggct acgttcgcca ggcctgcaaa 22560 

atctccaacg tagagctcag caacctggtt tcctacatgg gcatcctcca cgagaaccgg 22620 

ctggggcaga gcgtgctgca ctgcaccttg caaggcgagg cgcgaaggga ctacgtccga 22680 

gactgcgtct acctcttcct caccctcacc tggcagaccg ccatgggcgt gtggcagcag 22740 

tgcttggaag agagaaacct caaagagctg gacaaactcc tctgccgcca gcggcgggcc 22800 

ctctggaccg gcttcagcga gcgcacggtc gcctgcgccc tggcagacat cattttccca 22860 

gaacgcctga tgaaaacctt gcagaacggc ctgccggatt tcatcagtca gagcatcttg 22920 

caaaacttcc gctccttcgt cctggagcgc tccgggatct tgcccgccat gagctgcgcg 22980 

ctgccttctg actttgtccc cctttcctac cgcgagtgcc ctcccccact gtggagccac 23040 

tgctacctct tccaactggc caactttctg gcctaccact ccgacctcat ggaagacgtg 23100 

agcggagagg ggctgctcga gtgccactgc cgctgcaacc tctgcacccc ccacagatcg 23160 

ctggcctgca acaccgagct gctcagcgaa acccaggtca taggtacctt cgagatccag 23220 

gggccccagc agcaagaggg tgcttccggc ttgaagctca ctccggcgct gtggacctcg 23280 

gcttacttac gcaaatttgt agccgaggac taccacgccc acaaaattca gttttacgaa 23340 

gaccaatctc gaccaccgaa agcccccctc acggcctgcg tcatcaccca gagcaaaatc 23400 

ctggcccaat tgcaatccat caaccaagcg cgccgagatt tccttttgaa aaagggtcgg 23460 

ggggtgtacc tggaccccca gaccggcgag gaactcaacc cgtccacact ttccgtcgaa 23520 

gcagcccccc cgagacatgc cacccaaggg aaccgccaag cagctgatcg ctcggcagag 23580 

agcgaagaag caagagctgc tccagcagca ggtggaggac gaggaagagc tgtgggacag 23640 

ccaggcagag gaggtgtcag aggacgagga ggagatggaa agctgggaca gcctagacga 23700 

ggaggacgag ctttcagagg aagaggcgac cgaagaaaaa ccacctgcat ccagcgcgcc 23760 

ttctctgagc cgacagccga agccccggcc cccgacgccc ccggccggct cactcaaagc 23820 

cagccgtagg tgggacgcca ccggatctcc agcggcagcg gcaacggcag cgggtaaggc 23880 

caaacgcgag cggcgggggt attgctcctg gcggacccac aaaagcagta tcgtgaactg 23940 

cttgcaacac tgcgggggaa acatctcctt tgcccgacgc tacctcctct tccatcacgg 24000 
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"totooccttic 


cctcacaaca 

v_ v. L.v_y v.u ci V— y 


ttctctatta 

*• W V» *- V, *.** w *• *A 


ttaccatcat 

W W LA V# \a W^**A W 


ctctacagcc 


cctacgaaac 


24060 


actcaaaaaa 

y i. v. y y v*. y ** i* 


aaaaactaaa 


acctcctcto 


ccqcaaqaaa 


aaactccgcc 


gccgctgccg 

3 3 ^3 3 


24120 


ccaaaoat:cc 

V- *A UM y ** w V. V. 


accaaccacc 

%j V* V» VJ VJ X* W* LA W W 


aaaaaactaa 

y«yyayv.*yu 


gaaagegcat 


ctttcccact 


ctgtatgcta 


24180 


■tctttcaaca 

v. v. w w \. ** 


aaaccacaaa 


caacaccctc 


aacacaaact 

u y 3 U **** w 


gaaaataaaa 


aaccgctcct 


24240 


"tccartcact 


cacccacaac 

v.** v> w> w,y w*t*y v 


tatctatacc 


acaaaaaaaa 


aqaccaqctq 


cagcgcaccc 


24300 


taaacaacac 


caaaacacta 

v. ******** v.**w> 3 


ttcaacaaat 


actactcaac 


gtctcttaaa 


gactaaaaga 


24360 


LWVljCljL w w w 


■ttccccctcti 

l— l_ V— V. V. V. V. WV»y 


aacaccaaaa 


cccacatcat 


cgccagcatg 


agcaaggaga 


24420 


ttiCCC^CCCC 

L. V_ \_ V_ \— Ci. v_ v_ v_ v_ 


ttacatataa 

*_ w**v»%* v- y ty y 


aactatcaac 


cccaaataQQ 


cctaaccqca 

L yy*-*-3 v 3 


qqqqccqccc 

3333 V " V ■'3 V ■" X ■■ V ■" 


24480 


aaaactactc 


caacaaaata 


aactaactca 


acaccqqccc 

yv.y^-*-yyv.wx. 


ccacatgatc 


tcacgagtta 


24540 




aacccaccaa 

**y v» v» v. v* w* w> y ** 


aaccaaatcc 


tcttagaaca 


qacqqeaate 

yyv.yyv.uu.uv. 


accgccacac 


24600 


cccaacacca 


artcaaccca 

**v» w *j.****w* v»v*y 


cccaattaac 

*j* %■%«** y *• *»y*j^ 


ccqccqccca 


aqtqtatcaq 

33 *" 3 ^***-v.**y 


gaaactcccc 


24660 


QCCCQ3.CC&C 


aatcctccta 

**y *> v. *— w *■» v. wy 


ccacacaaca 

v* v»** y *_> y ** y 


caaaaaccaa 


agtcctcatg 


actaactctg 


24720 


aaatacaatt: 

y y y •» ** **** w *• 


2LQCQQQCQ0Q 

^y-yyy^yyy 


tccaaataca 

*>v.*»**yy i>uv>y 


ccaaqtacaq 


aaqtcqqqcc 


gctccttact 


24780 


ctcccaaaaa 


tataaaaaaa 

*>** uuuuyuyy 


ataatcattc 


aaaaccaaaa 

y u yy v -^-y u yy 


tatccagctc 


aacqacqaqq 

3 3 3 3 


24840 


caataaactc 


ctcaaccaat 


ctcaaaccta 


acaaaqtctt 

**wyy**y *-v« v» *. 


ccaactcqqa 


qqaqcqqqcc 

yy**y *-yyy v-w 


24900 


actcttcctt 

y v. w W* w w v« v. w *» 


caccactcac 

V> ** V. V» ** Wj l>*>M *> 


caaacctacc 


tqaccctqca 


aaactcttcc 


tcqcaqccqc 

w> w^^^ w. w w 


24960 


actccaaaaa 


aatcaacact 

LA vA ^^MH v*^^ w 


ctccaattca 


tggaagagtt 


cgtcccctcc 


gtctacttca 


25020 


acccattttc 

**V«V»V»y WWW V> V» 


coactcacct 

*>y y *— wv»**w»v» w 


aaacactacc 


cggacgcctt 


cattcccaac 


tttaacacaa 

w w t>y M\«y *>u«^ 


25080 


"taaataaatc 


*-y wyyuwyyw 


tacaactaat 

LUWMUV L- W LA L- 


aacaaataat 

y**v«**y** wyy *. 


acqqccqtqa 

3 33 3 3 


gagctegget 


25140 


acaacatrta 

y w>y**v»** v. v. wy 


catcactacc 

v* ** w v.** \— *-y v» ^» 


accaacctca 

y v* *» ** y v *■ *.^y 


ctactacact 


cqqqaqqcqa 
^yyy^yy^y^ 


tcatattcaq 

WW^^4 ^ y| W W* W 


25200 


ctactttaaa 

W» W** W W W bW I* y 


ctaccaaaca 


aacaccctca 


aaaaccaact 


cacaqqttqa 


aactcgagat 


25260 


"taaaaacara 


rt t a aat* etc 

*— *. l. y c*y *. V» *. *j» 


acctcatcaa 

***— *— *.*—** v. *-y ** 


CQCCtt.CRCC 


acccaacctc 

yvwyyvw *>v. 


tcctaataqa 

ww>^> ^*33 ****y ** 


25320 


aaccaaarnr 


aaaa"tcac1:a 
y y y *-*—***— *>** 


*»>>•** *.V»**V»*vV> w 


crt t: eta cat c 

y w *» v *> y w** *• 


taccccacac 

W Vj W W» W>. Liw Lj Wj* 


ccaqattaca 

W WMV|L4 L> UL4WIA 


25380 


taaaaatrta 

uuu ** w w> w y 


■tat"ta*tcatc 

*>y *• ***y *-•** *- *— 


tttacactca 

W W Lj*%^ ^*L4 


atttaataaa 

*J W W W LA LA L» LA LA LA 


aactaaactt 

LA LA W^ LHUIAW %m W 


tttaccatac 


25440 


cttcaacacc 


acacattatt 

**v«y*»y *• *>y *• 


tctccttata 

Lk K« L* L^ LJ W \4 


aaaaaacccc 

LA LA LA LA LA LA V* V* X* 


aaaaatcctt 

LA W| WJ LA W| WWW W L« 


aacttacaca 


25500 


taacaaaacc 

»>»*y V*********^ V. 


cttatatttt 

v» *. *p y *_** *>*•*>*• 


accataaaaa 


aacaactaac 

LA LA V* LA LA V* WV%ML« 


cctttcaatt 

W* W* W> Lb LWlALi W> W> 


aaaaaaaaat 

VJ L^ LA LA LA LA VJ VJ LJ W 


25560 


taacaatttc 


tactacaaaa 


caattaaaaa 

\#ViW L. L» W| LA LA LA 


acacaqcaaa 

*J ^^LA WUM L*vAL*Lj 


catacaaaac 

W* WJ W> LA WLA WJ WJ IA W 


aacactacac 

**L4 WM WLiWLiW 


25620 


cacccctaca 

^* W*» W* V* W* W* ^- LA W»* WJ 


taatatttcc 


cctttaaagc 


tgacagacaa 


eggtttaaca 


ttaagctatt 


25680 


caaatcccct 

^™ 7J LA Wj W»» V* \a Wa L* 


acotataata 

y *->y *-y wyy *.** 


ggtgaccaac 


ttacgtttaa 


ttttacttct 

W. W> W W> LA W* W> W W W» 


ccactacgtt 


25740 


acqaaaataa 

**»^»jj| ******** WHM 


caatcttaca 


ttcaactaca 


cttctcccat 

L* W Vp* %# W WW WL4 W 


qacactaata 


aacaacagtc 


25800 


ttactattaa 

W W*JW« W** *» W***A 


catcaatacc 

v>y *. w**** *>**v*w 


tccaaaaacc 

W V* LA LA LA \f %A 


tcaqtaataa 

LWUM W LA WJ ^ Wj LA 


caacaacaca 

LtfLiU w^ WJ wj VL4 WM 


ctcactataa 

V* WWM W W WLALA 


25860 


atattactcc 

***J *» W** Wj w V>V> 


aaattttaaa 

**y** *»*•*■ *»**y ** 


tttaacaact 


ctaatacctt 

w*.yy byViV *- *■ 


aacttttaac 

****v. *•*•** wyy v. 


atacaaaatc 

LA VLiWMVilAM Wj W 


25920 


tatggacttt 


tccaaccaaa 


actcctaact 


gtaccgtgtt 


taccgaaagt 


gactccctgc 


25980 


tgagtctttg 


cttgactaaa 


tgeggagetc 


aegtacttgg 


aagcgtgagt 


ttaageggag 


26040 


tggcaggaac 


catgetaaaa 


atgacccaca 


cttctgttac 


cgttcagttt 


tcgtttgatg 


26100 
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acagtggtaa 


actaatattc 


tctccacttg 


cgaacaacac 


ttggggtgtt 


cgacaaagcg 


26160 


agagtccgtt 


gcccaaccca 


tccttcaacg 


ctctcacgtt 


tatgccaaac 


agtaccattt 


26220 


attctagagg 


agcaagtaac 


gaacctcaaa 


acaattatta 


tgtccagacg 


tatcttagag 


26280 


gcaacgtgcg 


aaagccaatt 


ctactaactg 


ttacctacaa 


ctcagttaat 


tcaggatatt 


26340 


ccttaacttt 


taaatgggat 


gctgtcgcca 


atgaaaaatt 


tgccactcct 


acatcttcgt 


26400 


tttgctatgt 


tgcagagcaa 


taaaaccctg 


ttaccccacc 


gtctcgtttt 


tttcagatga 


26460 


aacgagcgag 


agttgatgaa 


gacttcaacc 


cagtgtaccc 


ttatgacccc 


ccatacgctc 


26520 


ccgtcatgcc 


cttcattact 


ccgcctttta 


cctcctcgga 


tgggttgcag 


gaaaaaccac 


26580 


ttggagtgtt 


aagtttaaac 


tacagggatc 


ccattactac 


acaaaatggg 


tctctcacgt 


26640 


taaaactagg 


aaacggcctc 


actctaaaca 


accagggaca 


gttaacatca 


actgctggcg 


26700 


aagtggagcc 


tccgctcact 


aatgctaaca 


acaaacttgc 


actagcctat 


agcgaaccat 


26760 


tagcagtaaa 


aagcaaccgc 


ctaactctat 


cacacaccgc 


tccccttgtc 


atcgctaata 


26820 


attctttagc 


gttgcaagtt 


tcagagccta 


tttttgtaaa 


tgacgatgac 


aagctagccc 


26880 


tgcagacagc 


cgcccccctt 


gtaaccaacg 


ctggcaccct 


tcgcttacag 


agcgctgccc 


26940 


ctttaggatt 


ggttgaaaat 


actcttaaac 


tgctgttttc 


taaacccttg 


tatttgcaaa 


27000 


atgattttct 


tgcattagcc 


attgaacgcc 


ccctggctgt 


agcagccgca 


ggtactctga 


27060 


ccctacaact 


tactcctcca 


ttaaagacta 


acgatgacgg 


gctaacacta 


tccacagtcg 


27120 


agccattaac 


tgtaaaaaac 


ggaaacctag 


gcttgcaaat 


atcgcgccct 


ttagttgttc 


27180 


aaaacaacgg 


cctttcgctt 


gctattaccc 


ccccgctgcg 


tttgtttaac 


agcgaccccg 


27240 


ttcttggttt 


gggcttcact 


tttcccctag 


ctgtcacaaa 


caacctcctc 


tccttaaaca 


27300 


tgggagacgg 


agttaaactt 


acctataata 


aactaacagc 


caatttgggt 


agggatttac 


27360 


aatttgaaaa 


cggtgcgatt 


gccgtaacgc 


ttactgccga 


attacctttg 


caatacacta 


27420 


acaaacttca 


actgaatatt 


ggagctggcc 


ttcgttacaa 


tggagccagc 


agaaaactag 


27480 


atgtaaacat 


taaccaaaat 


aaaggcttaa 


cttgggacaa 


cgatgcagtt 


attcccaaac 


27540 


taggatcggg 


cttacaattt 


gaccctaatg 


gcaacatcgc 


tgttatccct 


gaaaccgtga 


27600 


agccgcaaac 


gttatggacg 


actgcagatc 


cctcgcctaa 


ctgctcagtg 


taccaggact 


27660 


tggatgccag 


gctgtggctc 


gctcttgtta 


aaagtggcga 


catggtgcat 


ggaagcattg 


27720 


ccctaaaagc 


cctaaaaggg 


acgttgctaa 


atcctacagc 


cagctacatt 


tccattgtga 


27780 


tatattttta 


cagcaacgga 


gtcaggcgta 


ccaactatcc 


aacgtttgac 


aacgaaggca 


27840 


ccttagctaa 


cagcgccact 


tggggatacc 


gacaggggca 


atctgctaac 


actaatgtga 


27900 


ccaatgccac 


tgaatttatg 


cccagctcaa 


gcaggtaccc 


cgtgaataaa 


ggagacaaca 


27960 


ttcaaaatca 


atctttttca 


tacacctgta 


ttaaaggaga 


ttttgctatg 


cctgtcccgt 


28020 


tccgtgtaac 


atataatcac 


gccctggaag 


ggtattccct 


taagttcacc 


tggcgcgttg 


28080 


tagccaatca 


ggcctttgat 


attccttgct 


gttcattttc 


atacatcaca 


gaataaaaaa 


28140 
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ccactttttc 


attttaattt 


ctttttattt 


tacacgaaca 


gtgagacttc 


ctccaccctt 


28200 


ccatttgaca 


gcatacacca 


gcctctcccc 


cttcatagca 


gtaaactgtt 


gtgaatcagt 


28260 


ccggtatttg 


ggagttaaaa 


tccaaacagt 


ctctttggtg 


atgaaacgtc 


gatcagtaat 


28320 


ggacacaaat 


ccctgggaca 


ggttttccaa 


cgtttcggtg 


aaaaactgca 


caccgcccta 


28380 


caaaacaaac 


aggttcaggc 


tctccacggg 


ttatctcccc 


gatcaaactc 


agacagggta 


28440 


aaqqtqcqqt 

3 3 ^3 33 


ggtgttccac 


taaaccacgc 


aggtggcgct 


gtctgaacct 


ctcggtgcga 


28500 


ctcctgtgag 


gctggtaaga 

3 3 3 3 


agttagattg 


tccagtagcc 


tcacagcatg 


tatcatcagt 


28560 


ctacgagtgc 


gtctggcgca 

3 3 3 3 


gcagcgcatc 


tgaatctcac 


tgagattccg 


gcaagaatcg 


28620 


cacaccatca 


caatcaggtt 


gttcatgatc 


ccatagctga 


acacgctcca 


gccaaagctc 


28680 


attcgctcca 


acagcgccac 


cgcgtgtccg 


tccaacctta 


ctttaacata 


aatcaggtgt 


28740 


ctgccgcgta 


caaacatgct 


acccacatac 


agaacttccc 


ggggcaggcc 


cctgttcacc 


28800 


acctgtctgt 


accagggaaa 


cctcacattt 


atcagggagc 


catagatggc 


cattttaaac 


28860 


caattagcta 


ataccgcccc 


accagctcta 


cactgaagag 


aaccgggaga 


gttacaatga 


28920 


cagtgaataa 


tccatctctc 


ataacccctg 


atggtctgat 


gaaaatctag 


atctaacgtg 


28980 


gcacaacaaa 


tacacacttt 


catatacatt 


ttcataacat 


gtttttccca 


ggccgttaaa 


29040 


atacaatccc 


aatacacggg 


ccactcctgc 


agtacaataa 


agctaataca 


agatggtata 


29100 


ctcctcacct 


cactgacact 


gtgcatgttc 


atattttcac 


attctaagta 


ccgagagttc 


29160 


tcctctacag 


cagcactgct 


gcggtcctca 


caaggtggta 


gctggtgatg 

3 33 3 3 


attgtagggg 


29220 


gccagtctgc 


agcgataccg 


tctgtcgcgt 


tgcatcgtag 


accaggaacc 


gacgcacctc 


29280 


ctcgtacttg 


tggtagcaga 


accacgtccg 


ctgccagcac 


gtctccacgt 


aacgccggtc 


29340 


cctgcgtcgc 


tcacgctccc 


tcctcaatgc 


aaagtgcaac 


cactcttgta 


atccacacag 


29400 


atccctctcg 


gcctccgggg 

3 ZJ ZJ ZJ ZJ 


tgatgcacac 


ctcaaaccta 


cagatgtctc 


ggtacagttc 


29460 


caaacacgta 


gtgagggcga 

ZJ Zt 3 3 3 3 


gttccaacca 


agacagacag 


cctgatctat 


cccgacacac 


29520 


tggaggtgga 

ZJ ZJ 33 33 


ggaagacacg 


gaagaggcat 


gttattccaa 


gcgattcacc 


aacgggtcga 


29580 


aatgaagatc 


ccgaagatga 


caacggtcgc 


ctccggagcc 


ctgatggaat 


ttaacagcca 


29640 


gatcaaacgt 


tatgcgattc 


tccaagctat 


cgatcgccgc 


ttccaaaaga 


gcctggaccc 


29700 


gcacttccac 


aaacaccagc 


aaagcaaaag 


cactattatc 


aaactcttca 


atcatcaagc 


29760 


tgcaggactg 


tacaatgcct 


aagtaatttt 


cgtttctcca 


ctcgcgaatg 


atgtcgcggc 

3 3 3 3 


29820 


agatagtctg 


aaggttcatc 


ccgtgcaggg 


taaaaagctc 


cgaaagggcg 


ccctctacag 


29880 


ccatgcgtag 


acacaccatc 


atgactgcaa 


gatatcgggc 


tcctgagaca 


cctgcagcag 


29940 


atttaacaga 


tcaaggtcag 


gttgctctcc 


gcgatcacga 


atctccatcc 


gcaaggtcat 


30000 


ttgcaaaaaa 


ttaaataaat 


ctatgccgac 


tagatctgtc 


aactccgcat 


taqqaaccaa 


30060 


atcaggtgtg 


gctacgcagc 


acaaaagttc 


cagggatggt 


gccaaactca 


ctagaaccgc 


30120 


tcccgagtaa 


caaaactgat 


gaatgggagt 


aacacagtgt 


aaaatgtgca 


accaaaaatc 


30180 


actaaggtgc 


tcctttaaaa 


agtccagtac 


ttctatattc agtccgtgca 
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caactgtgcg 


ggaatatgca 


caacaaaaaa 


aaataaaaag 


aatcattaaa 


ctaaagaagc 


cagcagcaga 


caggcaaccg 


gctgtccccg 


aaaaagaaca 


acagaaactt 


cccaccatgt 


tacccccctc 


acattcatgt 


ccgccacaga 


atccaacgac 


agctgcaaag 


acagcaaaac 


cggtgaaaaa 


agcacataca 


tattagaata 


tcccagcaaa 


tgcacataaa 


tatgttcatc 


ccacgaaaaa 


atcgagctaa 


aattcaccca 


aatgacgcta 


ataccgcacc 


acccacgacc 


aaatccagcg 


ccgtcagcac 


ttccgcaatt 


tttcacattc 


ccacacacac 


ccgcgccctt 


gcacgtcacc 


ccggccccgc 


ctcgctcctc 


gaataaggta 


tattattgat 


gatg 



<210> 6 

<211> 34115 

<212> DNA 

<213> simian adenovirus SV-39 

<400> 6 



catcatcaat 


ataacaccgc 


aagatggcga 


gttacgcgac 


ctttgtcttg 


ggaacgcgga 


cggggcgggg 


cgggcgtgtc 


gcgcggcggt 


ttttattcat 


tttgcaagtt 


tttctgtaca 


aaaagtgaaa 


aatgccgagg 


tagtcaccgg 


tacccgctga 


cgtgtgggtt 


tcggtctcta 


aaaggtcccc 


attttatgac 


tccacgtcag 


accgtcaaga 


ggccactctt 


gagtgccggc 


actgtgaaaa 


aatgaggaac 


ttcttgctat 


agctgttgga 


ggacattgtt 


accggagctc 


tttgtgaaga 


ttttagtctt 


catgatctct 


aggatgaatg 


gctggagtct 


gtggatgggt 


atttgccacc 


acctcacaac 


tctcacactg 


catcaggtga 


acttgacttg 


gcttgttacg 


acagcgggat 


cagcgatccc 


acggctttta 


aagatgatga 


tggcgatgat 


ggatttcgac 


gtaagtcctg 


tgaataccac 


cgggatcgta 



aatagggcgg 


ctcagataca 


tgttgaccta 


30300 


ttggcgaacg 


gtgggataaa 


tgacacgctc 


30360 


ggaaccgcgg 


taaaattcat 


ccgaatgatt 


30420 


actcggttgg 


atctcctgag 


cacacagcaa 


30480 


aaaaaaacgt 


cccagatacc 


cageggggat 


30540 


aatccctctg 


ggagcgatca 


caaaatcctc 


30600 


accctgttgc 


tggggcaaaa 
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tatacttttc 


catcagacta 


22260 


tgcataatct 


ccatgccctt 


ttcccaggcg 


caaacaatct 


tggtgctaca 


cgggttaggt 


22320 


atggccaaag 


tggttggcct 


ctgaggcggc 


gcttgttctt 


cctcttgagc 


cctctcccga 


22380 


ctgacggggg 


ttgaaagagg 


gtgccccttg 


gggaacggct 


tgaacacggt 


ctggcccgag 


22440 


gcgtcccgaa 


gaatctgcat 


cgggggattg 


ctggccgtca 


tggcgatgat 


ctgaccccgg 


22500 


ggctcctcca 


cttcgtcctc 


ctcgggactt 


tcctcgtgct 


tttcggggga 


cggtacggga 


22560 


gtagggggaa 


gagcgcggcg 


cgccttcttc 


ttgggcggca 


gttccggagc 


ctgctcttga 


22620 


cgactggcca 


ttgtcttctc 


ctaggcaaga 


aaaacaagat 


ggaagactct 


ttctcctcct 


22680 


cctcgtcaac 


gtcagaaagc 


gagtcttcca 


ccttaagcgc 


cgagaactcc 


cagcgcatag 


22740 


aatccgatgt 


gggctacgag 


actccccccg 


cgaacttttc 


gccgcccccc 


ataaacacta 


22800 


acgggtggac 


ggactacctg 


gccctaggag 


acgtactgct 


gaagcacatc 


aggcggcaga 


22860 


gcgttatcgt 


gcaagatgct 


ctcaccgagc 


gactcgcggt 


tccgctggaa 


gtggcggaac 


22920 


ttagcgccgc 


ctacgagcga 


accctcttct 


ccccaaagac 


tccccccaag 


aggcaggcta 


22980 


acggcacctg 


cgagcctaac 


cctcgactca 


acttctaccc 


tgcctttgcc 


gtgccagagg 


23040 


tactggctac 


gtaccacatt 


tttttccaaa 


accacaaaat 


ccctctctcg 


tgccgcgcca 


23100 


accgcaccaa 


agccgatcgc 


gtgctgcgac 


tggaggaagg 


ggctcgcata 


cctgagattg 


23160 


cgtgtctgga 


ggaagtccca 


aaaatctttg 


aaggtctggg 


ccgcgacgaa 


aagcgagcag 


23220 


caaacgctct 


ggaagagaac 


gcagagagtc 


acaacagcgc 


cttggtagaa 


ctcgagggcg 


23280 


acaacgccag 


actggccgtc 


ctcaaacggt 


ccatagaagt 


cacgcacttc 


gcctaccccg 


23340 


ccgttaacct 


ccctccaaaa 


gttatgacag 


cggtcatgga 


ctcgctgctc 


ataaagcgcg 


23400 


ctcagccctt 


agacccagag 


cacgaaaaca 


acagtgacga 


aggaaaaccg 


gtggtttctg 


23460 


atgaggagtt 


gagcaagtgg 


ctgtcctcca 


acgaccccgc 


cacgttggag 


gaacgaagaa 


23520 


aaaccatgat 


ggccgtggtg 


ctagttaccg 


tgcaattaga 


atgtctgcag 


aggttctttt 


23580 


cccacccaga 


gaccctgaga 


aaagtggagg 


aaacgctgca 


ctacacattt 


aggcacggct 


23640 


acgtgaagca 


agcctgcaag 


atttccaacg 


tagaacttag 


caacctcatc 


tcctacctgg 


23700 


ggatcttgca 


cgaaaaccgc 


ctcggacaaa 


acgtgctgca 


cagcacactg 


aaaggagaag 


23760 
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ctatatacaa 
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tcctagcgct 


agtgtacacc 


tqqcagagcg 
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aaataaaaat: 


ctaacaacaa 
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tacctaaaaa 


acaaaaacct 


caaaaaactt 
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aaaaaqctqc 


23880 


taatacartc 


caaaaaaaca 


ctataaacca 


attttaacaa 


gcgcaccgcc 


acacqaqacc 


23940 


taactaatat 
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tatttttcct 
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cccaaactaa 


tacaaactct 


ccaaaaaaaa 

23 2? 23 ****** 23 23 


ctgecagatt 


24000 


ttataaatca 


aaacatctta 


caaaacttcc 


actctttcat 


cttaaaacac 
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tegggaatet 


24060 


tlQCCCQCCB.C 


taactacacc 
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ctacccacaa 


attttatacc 


tctccactac 
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cgcgaatgcc 


24120 


caeca ccact 


ataaccatac 

23 *■ **§ *2? *^ ****** 


acttacttgc 


ttaaactggc 


caactttcta 


atgttccact 


24180 


ctaacctaac 

W**! ****** %■ *- 23 23 X»» 


aaaaaacatt 

** ** ** ** *3 ** 2J *» w 


aacaacaaaa 


aactactaaa 

yy*- L y^ «-**y** 


atgccactgc 


cgctgcaacc 


24240 


tatacacccc 

*J» ** W V» ** *-» *■» V* *-» 


ccaccactct 

%*• *-• ** *— *** *■ X* *» 


etagtatgea 


acactcccct 


actcaataaq 


acccagatca 


24300 


tcaatacctt 


taaaatccaa 


aaaccctcca 


acacaoaaaa 


caacaaacaa 


aaatctaaac 
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24360 


taaaactcac 


aaccaaacta 
y y" **y 


taaacctcca 


cctacttaca 


caaatttata 

Vj* ****** *» *» *w 23 *»** 


ccaaaaaact 

^* ^* ** 23 ** ** 23 ** *» 


24420 


atcacaccca 


ccaaattaaa 


ttttacaaaa 


accaatcaaa 


accacccaaa 


aacaaattaa 

** *j 23 *** *3 *-**** 


24480 


cqqettacat 


cattaegcag 


agcagcatag 


ttqaacaqtt 


geaagecatt 


aacaaagege 


24540 


gqcaaqaqtt 


tctcctaaaa 


aaaqqaaaaq 


qqqtctactt 


qqacccccaq 


accqqeqaqq 

»*^*«yy^y«*yy 


24600 


aactcaaegg 


accctcctca 


atCQcaaatt 

y Lv -y v - w yy *- w 


qtqtqcccca 


tgccgcccaa 


aaagaacacc 


24660 


tcacaataaa 


acataccaaa 
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gaeggaggaa 


aaaaaptaaa 

y**yy**y *»yy** 


acaatataaQ 
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caacagegaa 


24720 


acaaaaaaaa 


aaccataacc 
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caaaaaatac 
v y a yyyy 


aacaaaaaaa 

***** 23 Z3 23 23 ***** 23 


aaaacacaaa 

**23 23 **** j * ****** 2*# Z3 


aaaacaacaa 
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24780 


agtcttcgcc 


gaagaactct 


caccactacc 
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caaccaacca 
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cctcaaccca 
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24840 


agatcccgca 


cacacccgta 


aataaaataa 

y ** ^yyy** *-«*y 


caagaccaaa 


aaaccaaata 


agagaaaege 


24900 


tcgcccccgc 


caoQQctacc 


actcataaaa 

y u yy u y 


aaaacacaaa 

****** 23 ** ****** 


aactacatct 

**** *» 23 ^»** *■ *■ 


tatcatactt 


24960 


qctccaqtqc 


qqcaaaaacq 


tttcattcac 


ccQtaaatac 


ttacttttta 

w w 23 w w w w *» ** 


acaaaaaaat 


25020 


qqccatcccc 


cqtaacatcc 


tccactacta 


ccatcactct 


tacaactcca 


aaacaaacaa 


25080 


ctaagaaaac 


acaacaatta 


ccaacaaaaa 
*- c yy*-yyy u y 


aactacatct 
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caacacccaa 

******* 23 *■*> 23 ^— • *— V *j ** 


aaacccccaa 

23 ** ** *j#> *j* *mi ****! 


25140 


ccaccaaaaa 


gctccgaaac 


cacatatttc 


ccaccctcta 


cactatcttt 


caacaaaacc 


25200 


qqqqqcaqca 


gcaagaactg 


aaaataaaaa 


accgcacgct 


aaaatcactt 

y yy *»^*y *- ^ 


acccaaaact 

** *a> *j* V«*4 *****3 *«* *■ 


25260 


gcctctatca 


caagagegaa 


aaacaactac 


agcgaaccct 


aaaaaacaca 

yy yy wx, y >,u 


aaaacactat 

23 ******! *** 2a 23 *» 


25320 


tccagaagta 


ctgcgcgacc 


accctaaata 


actaaaaaaa 

***i* ************** 23 


cccacacaca 

*- *"y *-y *«»y ^y 


aaacttcaaa 

23 23 *"* *■* *■ *• *** ****** 


25380 


ccqtctqaca 


tcaccagccg 


cgcgccaaaa 


taaacaaaaa 

v* 23 **23 ****** 23 ** 


aattcccaca 


ccttacatat 

*j* *v *- *» ** *— ** ** 23 


25440 


qaaattacca 

*• *» ** *^ V* ** 


occacaaata 


aaattaacca 

yy** *» *»**y *— *~y 


ccaacaccac 
yy y y 


ccaaaattac 


tccacaaaaa 


25500 


tqaactaact 

W *J ****** fcjJjV' V> 


caacaccaaa 


ccccacataa 


tttcccacat 

** *• %m *j« *j0 *j« *3 23 


aaacaacatt 


CQCQCCCB.CC 


25560 


gcaatcagct 


attgttagaa 


caqactactc 


taaccaccac 

*« 23 ** *^ 23 ****v» 


accccataat 


aacctaaacc 


25620 


ctcccagctg 


gccagctgcc 


ctoatatacc 


aaaaaacacc 

**"23 Z3 ******** 23 


tccacccacc 


aacatacttt 


25680 


tgccccgtga 


cgcccaggcg 


gaagtccaga 


tgactaaege 


gggcgcgcaa 


ttagcgggcg 


25740 


gatcceggtt 


teggtacaga 


gttcacggcg 


ccgcacccta 


tageccaggt 


ataaagaggc 


25800 


tgatcattcg 


aggcagaggt 


gtccagctca 


acgacgagac 


agtgagctct 


tcgcttggtc 


25860 
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tacgaccaga 


cggagtgttc 


cagctcgcgg 


gctcgggccg 


ctcttcgttc 


acgcctcgcc 


25920 


aggcatacct 


gactctgcag 


agctctgcct 


ctcagcctcg 


ctcgggagga 


atcggacccc 


25980 


ttcagtttgt 


ggaggagttt 


gtgccctcgg 


tctactttca 


gcctttctcc 


ggatcgcccg 


26040 


gccagtaccc 


ggacgagttc 


atccccaact 


tcgacgcggt 


gagtgactct 


gtggacggtt 


26100 


atgactgatg 


tcgagcccgc 


ttcagtgcta 


gtggaacaag 


cgcggctcaa 


tcacctggtt 


26160 


cgttgccgcc 


gccgctgctg 


cgtggctcgc 


gacttgagct 


tagctctcaa 


gtttgtaaaa 


26220 


aacccgtccg 


aaaccgggag 


cgctgtgcac 


gggttggagc 


tagtgggtcc 


tgagaaggcc 


26280 


accatccacg 


ttctcagaaa 


ctttgtggaa 


aaacccattt 


tggttaaacg 


agatcagggg 


26340 


ccttttgtaa 


tcagcttact 


ctgcacctgt 


aaccatgttg 


accttcacga 


ctattttatg 


26400 


gatcatttgt 


gcgctgaatt 


caataagtaa 


agcgaattct 


taccaagatt 


atgatgtcca 


26460 


tgactgttcc 


tcgccactat 


acgatgttgt 


gccagtaaac 


tctcttgtcg 


acatctatct 


26520 


gaactgttcc 


ttttggtccg 


cacagcttac 


ttggtactac 


ggtgacaccg 


tcctttctgg 


26580 


ctcactgggc 


agctcacacg 


gaataacact 


tcacctcttt 


tcgccgtttc 


gatacggaaa 


26640 


ctacagctgt 


cgtgccggta 


cctgcctcca 


cgttttcaat 


cttcagccct 


gtccaccgac 


26700 


caaacttgta 


tttgtcgact 


ctaagcactt 


acagctcaac 


tgcagcattc 


taggccccag 


26760 


tatcttgtgg 


acatacaata 


aaatcaggtt 


ggtggaattt 


gtctactacc 


cacccagcgc 


26820 


ccgcggtttt 


ggggaaattc 


ctttccagat 


ctactacaac 


tatcttgcca 


cacattatgc 


26880 


aagtcaacag 


caactaaact 


tgcaagcacc 


cttcacgcca 


ggagagtact 


cctgtcacgt 


26940 


aggctcctgc 


acagaaactt 


ttattctctt 


caacagatct 


tctgccattg 


aacgcttcac 


27000 


tactaactac 


tttagaaacc 


aagttgtgct 


tttcactgac 


gaaaccccta 


acgtcaccct 


27060 


ggactgtgca 


tgtttttctc 


atgacaccgt 


aacttggact 


cttaacaata 


ctctctggct 


27120 


cgcgttcgat 


aaccaaagct 


tgattgttaa 


aaattttgat 


ttaaccttta 


ctaaaccctc 


27180 


tcctcgcgaa 


atagttatct 


ttgctccttt 


taatccaaaa 


actaccttag 


cctgtcaggt 


27240 


tttgtttaag 


ccttgccaaa 


caaactttaa 


gtttgtttat 


ttgcctccgc 


aatctgtcaa 


27300 


actcatagaa 


aaatacaaca 


aagcgcccgt 


cttggctcct 


aaaaccttct 


accactggct 


27360 


aacctacacg 


gggctgtttg 


cactaattgt 


ttttttccta 


attaacattt 


ttatatgttt 


27420 


cttgccttcc 


tccttctttt 


cgcgaacacc 


gttgccgcag 


aaagacctct 


ccttattact 


27480 


gtagcgcttg 


ctatacaaaa 


ccaagagtgg 


tcaaccgtgc 


tctcaatcta 


ttttcaattt 


27540 


ttcattttgt 


ccttaatact 


ttctcttatt 


gtcgttaaca 


atgatctgga 


gcattggtct 


27600 


cgcctttttt 


tggctgctta 


gtgcaaaagc 


cactattttt 


cacaggtatg 


tggaagaagg 


27660 


aactagcacc 


ctctttacga 


tacctgaaac 


aattaaggcg 


gctgatgaag 


tttcttggta 


27720 


caaaggctcg 


ctctcagacg 


gcaaccactc 


attctcagga 


cagacccttt 


gcatccaaga 


27780 


aacttatttt 


aaatcagaac 


tacaatacag 


ctgcataaaa 


aactttttcc 


atctctacaa 


27840 


catctcaaaa 


ccctatgagg 


gtatttacaa 


tgccaaggtt 


tcagacaact 


ccagcacacg 


27900 
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yaaL ILL LaL 


tttaatrtna 

LLLaaLLLya 


ranttattaa 
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Uy L UU L L LLL 


attcctatct 

U L LLL LU LLL 


ataaatttaa 

y y y *— Luy 
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etc c ra o T t T 

L, LLLLQy l_ l_ L 


rtttrtaaaa 

L.LLLL LyClQO 


cctactattt 

LL LUL Ly L L L 


aattartata 

U,U w WU V» w U LU 


aactacacta 

UUL Ly LUV. LV* 


aaaatcacct 


28020 


tcaeaccaee 

LLULuLLULL 


ataatctaca 

U LUU LL. LUV.U 


atcacacaca 

U LLULULULU 


atcaccttaa 

(A L\>uv\i L L y y 


attttaaacc 


taaaattttc 


28080 


tcracarata 

lllululu Ly 


cettcacaat 

LL L LV.yL.UU L 


ttctcacaca 

L LL LLULyLU 


aaL*taccatc 

uy l l 14 w l» y l w 


tctaacataa 


acaaacaatt 


28140 


taarttttac 

Ly y L L L L LuV. 


tatcctttcc 

LCILL.LL L LLL. 


acaaactata 

uLyuuL Ly Ly 


caaaataatt 

L.y UUU LUU L L 


aaaaccaaat 


ataaaccaaa 


28200 


ctactttact 

l_ Lav. L V_ LuU V- 


taeattaeea 

LQLQ L Uy LLu 


ttantataat 

l Ly y Ly luu l 


cat tattLac 

Ly LLy ll Lyv. 


ctttacttta 

V» L L l y LLL Ly 


ttattaaaaa 

l lu l Lyyyyy 


28260 


atatatttat 
y ^*y *»y u ^ lu l 


ttataeattc 

L L y LULU L LL 


aaaaaaaaat 
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at "tactctca 

U L LyL. L L, LLy 


ctatactcct 

V* Ly LyL LL.V- L 


acaattacaa 


28320 


aacaaaanaa 


aaaattaaaa 

uy UU L LUUUU 


tctctacact 

V L V L I>U LU L L 


ttattaatat 


tttccaaaaa 

W W W- \n VLA\4L4V4L4 


tggcaaaact 


28380 


aacactccta 


cttttacttc 

L L L L Ly L L L L 


LLULy LLyy l 


aacacttttt 

yULM w L L L L L 


accatcactt 


tttctaccac 


28440 


caceacarte 


aaacctcaat 

y UU L. L L LUU L 


y L L Ly LLULL 


aattaaaatc 

yy tyuuy l v— 


tactttatct 

LUL LLL y L W L 


acatattact 

ULy Ly l l y l l 


28500 


y L y**' *-y ^-y L 


aarntttnra 
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atataarato 
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ttttaccttt 

LLL LULL LLL 


atttttcttc 

y LLL LLLLLL 


aatacattaa 

uy l y l u l Ly u 


28560 


rtarttrtfici 

l l c\ l l l l Ly y 


nt ranartrt 

y LLayaL ll>l 


artarroraa 

UL LULLy Luy 


acacacacct 
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caatatcaaa 

LUy LU L LUUU 


atcaacaaat 

U LLUUVUUU L 


28620 


taccaaarta 


l LLyy LL. LyL 
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ttatatttta 
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ccctaat ttt 
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ttttcacctt 

LLL LLULL L L 


28680 


acttarnrtt 
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ataattttca 
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cttcacvtcaa 


ttttaaaaaa 
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cacaatactt 
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caacccacac 

LyuLLLyLyL 


28740 


ctctccaaca 

l l l l l, \_ w. a l. y 


actaaataat 

ul LyyuLyuL 
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attaccacac 
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ttaaaacatt 


taaacttttt 
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aataatttta 
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ctttaratta 
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tttccaatta 

L L LLLUVj L LU 


rarrtartcr 

L U L L LCLL LLL 


occacctccc 


CCQClCCtL"tCQ 
yy y 
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29040 
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aaccatacaa 

UULLU LULua 
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aatataaaca 
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tatattataa 
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29100 
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uuul v>aay y y 


tnaaarraan 
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L UUU L UUU L L 
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U.UUU L L LUL L 


taaaatctto 
L yyyy *- w y 


29640 


tccctaaaaa 


tatccaaaec 

L U LLLUUULL 


tttaactttt 

L L LUUL L L L L 


aatacctcca 

UU Ly L L L L LU 


aaoctctaaa 

uy y l ll Luuy 


rctaartatt 

LLLyyLLuLL 


29700 


aatrraanat 

yy LLLuyy a l 


taaaaattra 

LClClaQO L L LU 


aoat*aot*aaa 

uy u Ly y l uuu 


r t a o t n n a aa 
(-Lay t-yyyyy 


aodoacaaor 

uy y y uLuuy l 


a a 1 1 f*"t"1~ n ra 

uu LLLL Ly Lu 


29760 

C _7 / \J\J 


aacctaccac 

UULL L>y LLUL 


ttcaaatcac 

L LLUUU LLUL 


caacaacaca 

LUU LUU LULU 


atttrartac 

U L L LLCLL LUL 


attttaaaaa 

y l l l Lyy y uu 


racarttnrr 

LU Lu L L Ly LL 


29820 


ttgaatgaca 


ataatgaact 


ccaaaccaca 


ctaaaatctt 


catcgcccct 


taaaatcaca 


29880 


gaccagactc 


tgtcccttaa 


cataggggac 


agecttgeaa 


ttaaagatga 


caaactagaa 


29940 


agcgctcttc 


aagcgaccct 


cccactctcc 


attagcaaca 


acaccatcag 


cctcaacgtg 


30000 
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ggcaccggac 


tcaccataaa 


tggaaacgtt 


ttacaagctg 


ttcccttaaa 


tgctctaagt 


30060 


cccctaacta 


tttccaacaa 


taacatcagc 


ctgcgctatg 


gcagttccct 


gacggtgctt 


30120 


aacaatgaac 


tgcaaagcaa 


cctcacagtt 


cactcccctt 


taaaactcaa 


ctccaacaac 


30180 


tcaatttctc 


tcaacactct 


atctccgttt 


agaatcgaga 


atggtttcct 


cacgctctat 


30240 


ttgggaacaa 


aatctggctt 


gctagttcaa 


aacagtggct 


taaaagttca 


agcgggctac 


30300 


ggcctgcaag 


taacagacac 


caatgctctc 


acattaagat 


atctcgctcc 


actgaccatt 


30360 


ccagactcgg 


gctcagaaca 


aggcattctt 


aaagtaaaca 


ctggacaggg 


cctaagtgtg 


30420 


aaccaagctg 


gagcgcttga 


aacatcccta 


ggaggtggat 


taaaatatgc 


tgataacaaa 


30480 


ataacctttg 


atacaggaaa 


cggactgaca 


ttatctgaaa 


ataaacttgc 


agtagctgca 


30540 


ggtagtggtc 


taacttttag 


agatggtgcc 


ttggtagcca 


cgggaaccgc 


atttacgcaa 


30600 


acactgtgga 


ctacggctga 


tccgtctccc 


aactgcacaa 


ttatacagga 


ccgcgacaca 


30660 


aaatttactt 


tggcgcttac 


cattagtggg 


agccaagtgc 


tggggacggt 


ttccattatt 


30720 


ggagtaaaag 


gccccctttc 


aagtagcata 


ccgtcagcta 


ccgttacagt 


acaacttaac 


30780 


tttgattcca 


acggagccct 


attgagctcc 


tcttcactta 


aaggttactg 


ggggtatcgc 


30840 


caaggtccct 


caattgaccc 


ttaccccata 


attaatgcct 


taaactttat 


gccaaactca 


30900 


ctggcttatc 


ccccgggaca 


agaaatccaa 


gcaaaatgta 


acatgtacgt 


ttctactttt 


30960 


ttacgaggaa 


atccacaaag 


accaatagtt 


ttaaacatca 


cttttaataa 


tcaaaccagc 


31020 


gggttttcca 


ttagatttac 


atggacaaat 


ttaaccacag 


gagaagcatt 


tgcaatgccc 


31080 


ccatgcactt 


tttcctacat 


tgctgaacaa 


caataaacta 


tgtaaccctc 


accgttaacc 


31140 


cgcctccgcc 


cttccatttt 


attttataaa 


ccacccgatc 


caccttttca 


gcagtaaaca 


31200 


attgcatgtc 


agtaggggca 


gtaaaacttt 


tgggagttaa 


aatccacaca 


ggttcttcac 


31260 


aagctaagcg 


aaaatcagtt 


acacttataa 


aaccatcgct 


aacatcggac 


aaagacaagc 


31320 


atgagtccaa 


agcttccggt 


tctggatcag 


atttttgttc 


attaacagcg 


ggagaaacag 


31380 


cttctggagg 


attttccatc 


tccatctcct 


tcatcagttc 


caccatgtcc 


accgtggtca 


31440 


tctgggacga 


gaacgacagt 


tgtcatacac 


ctcataagtc 


accggtcgat 


gacgaacgta 


31500 


cagatctcga 


agaatgtcct 


gtcgccgcct 


ttcggcagca 


ctgggccgaa 


ggcgaaagcg 


31560 


cccatgttta 


acaatggcca 


gcaccgcccg 


cttcatcagg 


cgcctagttc 


ttttagcgca 


31620 


acagcgcatg 


cgcagctcgc 


taagactggc 


gcaagaaaca 


cagcacagaa 


ccaccagatt 


31680 


gttcatgatc 


ccataagcgt 


gctgacacca 


gcccatacta 


acaaattgtt 


tcactattct 


31740 


agcatgaatg 


tcatatctga 


tgttcaagta 


aattaaatgg 


cgccccctta 


tgtaaacact 


31800 


tcccacgtac 


aacacctcct 


ttggcatctg 


ataattaacc 


acctcccgat 


accaaataca 


31860 


tctctgatta 


atagtcgccc 


cgtacactac 


ccgattaaac 


caagttgcca 


acataatccc 


31920 


ccctgccata 


cactgcaaag 


aacctggacg 


gctacaatga 


cagtgcaaag 


tccacacctc 


31980 


gttgccatgg 


ataactgagg 


aacgccttaa 


gtcaatagtg 


gcacaactaa 


tacaaacatg 


32040 
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taaataatat 


ttraaraaot 

l i_ l_ aa. v_ a ay l 


a era r teat a 

y LLaL uuy lg 


L y a yy ^y^y ^ 


atcatatccc 

a l La Ly llll 


aaaaaacaaa 
yyy yyy 


32100 


rractrrata 


aarartcicaa 

aaLaL ty VQO 


aarcaacaca 


L L L L a L La L L 


CCCCQCtlCQQ 
LLLLyLaLyy 


cactcacatc 

LU.L W.LU.LU b>L 


32160 


atarataata 


ttcatatcac 

L L L. a L CI l_Vwdl_ 


aotrroflaao 

ay iv.Lyy aay 


ctoaaaacaa 

l LyayyaLaa 


aaaaaaatct 

yy aaaay ll l 


caaaaacatt 


32220 


ttcataaaac 

t- LLa lquu y V. 


aataataoat 
yy La y L yyy L 


actcettata 

av> lv>^ l i- y La 


aaaattcaat 

yyyy ^ l l uy l 


CQQCeiCCQClt. 

LyyLULLyy i» 


atctcctcac 

U V. L Vj L L L VI L 


32280 


cttctaaacc 


ataacacaca 

u uaa\<a^.auu 


aattaaaatc 

ay l Luaya l l 


taatttcaaa 

l y a l l l Laay 


atactttcta 

y l a L L L LL Ly 


aataaaaacc 

UH vy VIVVVkV&LL 


32340 


aaatactttc 


ccaacaatat 


ate co at ate 

vi v. l v.y u *- y w l 


ttcaatcccc 

\* bvyy w v>l ll 


acatcaataa 

y l y k.Lyy wwy 


cactccttac 

Ly L V> V w Ly L 


32400 


aatacacaca 


aaacaaccac 


tcacacaaac 


ccaaaaaaca 


attttccaca 

y Www W LLVl Ly 


aacaataaca 

y vwLyy v> vj u\»u 


32460 


aattaatccc 

Uy L Laa l l l l 


cctcaatetc 

L v. Lv<a^ L L L l 


aaaaccaata 


taatttcttc 

Lay L L LL L LL 


cacaataaca 

LULUy bUUwU 


taaaccaaac 

Vj VVVJ y LLVLUVkL 


32520 


ccaaccaaaa 


aacacaaact 

auLHwaay l l 


aacacatccc 

yyuaLy llll 


attcaacaao 

y l LLaaLyyy 


aaaacaaaaa 

ay y a Laay y a 


aacaaaaaca 

ay Lay ay y La 


32580 


aaaa cat aaa 

yayyLaLayy 


raaaaraara 

l aaa y v. a a l cl 


aaatttttat 

y aa l l l u La l 


l l Laa l Lyy l 


cacataacac 

La Ly Lay LaL 


ttcaaacacc 

l LLaaaLaLL 


32640 


aaatrarnta 

ayy LLaLy La 


aa Lyy cay i_y 


a trttaaatt 

aLLLLyyy l l 


l l l Ly a Lyy a 


arataaraar 

a l a Laa Lay l 


aaaatraaac 

aay a l l aa a l 


32700 

J C i \J\J 


ataaaaraat 

ci tyaya^ya l 


trtraanntn 

i- V- l Laay y i_y 


attaarrara 


a ft* on a a t"Ta 

y l l y y aa l l a 


aatrrtr car 

aa LLL LLLaL 


arararattt 

y Ly LaLa lll 


32760 

J C 1 \3\J 


aaaaacacra 


oraataraaa 

y L.au. l av_aaa 


aarr raattt 

ay v* LLy y l l l 


trt* ccoao at 

ll LLLyyya l 


ctatcataar 

l La l l a Lay l 


aacaraatca 

ay La Lay LLa 


32820 

J Cm Ll 


tcaattaatc 

uv.uul Lay 


rcaaataatt 

L.v.aay ma l l 


ttrrratttr 

L LLLLy L L L V- 


raatrtatta 

Laa ll Ly l l a 


taatttaraa 

Laa l l Ly Lay 


aataataccr 

a a Laa Ly lll 


32880 

J l— L» L» \J 


tataaatrra 


aacraarrat 

ay Lcyy v,ca v. 


aaraaaaaac 


traaataata 

LLaya Laa Ly 


cactttrrac 

LaL L L LLLaL 


atacattrat 

y Ly La l LLy l 


32940 


aaararacrr 


tcatcttatc 

L. v_ a i_ v_ i_ i_ y l l 


aatrraaaaa 

aa l l maaaa 


atcttcttct 

y LL L LL L LL L 


taaaaaarrt 

Ly ay aaaLL l 


ataataaatt 

y Lay Laaa l l 


33000 


aaaaatcacr 


anattaaart 


raa tare tar 

cy a lull l a l 


atcrraaaar 

a LLLLyy ay l 


ttcattctca 

L L La L LL LLa 


acatararta 

y l a l y l a l Ly 


33060 


raaataatrr 

Laaa Lya L V v. 


ayuay a LLay 


aaranraatt 

nuL.ay Laa l l 


ay Lay l Lay l 


tratrrrron 

LLa L LLLLyy 


tttrraattr 

L L L L Lay L LL 


33120 

J J JL.C \J 


COaaattCCC 


acaacaatta 


tcartcaaaa 

l La l l Ly aaa 


cataaaacaa 

Ly l y y y a Laa 


atraaaataa 

a l Ly aaa Laa 


cataaartcr 

La Ly ay l lll 


33180 


cacataaaca 


aaaaccataa 

viviviy v«Ly i» w y 


aaccaataca 

yyLLuy Ly lu. 


ataatcacaa 

a Laa l l ul uy 


aaccaacaaa 

uuv,v>uyv>yyu 


aaaaaaatta 

aaaaaya l l y 


33240 


caactcatat 


ttcaaaaaac 


tctacaaatc 


aaaattcaac 

aaaa l l v>uy l 


tcatacaaat 

l La Ly Laaa l 


aacacaataa 

auLaLay Laa 


33300 


aatttacaat 

ay v. v. Ly Ly y l 


ataataarra 

a Lay LaaLLy 


aaaarrarar 

aaaaLLaLaL 


aaatroarnt 


traaaratrt 

l Laaa La lll 


raarttarrt 

Lyy L L La LL L 


33360 

J mt J \J\J 


aaaaaaaaaa 

uuaaaa y a a y 


rarattttta 

LaLO ILLL La 


aarrarantr 

aaLLaLay l l 


acttrrtnaa 

y l l lll Ly aa 


ra aaa ana a a 

Lay y ay y aaa 


tataataraa 

La Lyy uy v-y y 


33420 


cataaaaeca 


y aLy v.y cvhIm 


Lyy a l l LLLy 


ocaoaaccct 

y Lay ay lll l 


aataataraa 

y a Laa Lauay 


rraartataa 

LLay l Ly Lyy 


33480 

J J ™ L» L* 


ttaaacaaca 

l tuau cay v.a 


aaacetttaa 

aaav>v« i_ l Laa 


l, LLyyLaaLy 


attaaaatrt 

y Lyayy lll 


ccacataatr 

LLaLa Laa ll 


aarflcccaca 

ayLyLLLaLa 


33540 


aaaatcccat 

uuua LLLLa l 


ctcaaactta 

l LLy CLCL L L Ly 


rtrorataon 

l l Ly Ly Lay y 


aa art aaaa t 

y ay l Laaaa l 


aaccaatata 

y y l Lay l a l a 


nccccaTacic 

y LLLLa Lyy l 


33600 


acccaaacac 


taatctacaa 

Lac* l v_ uy won 


atatatnaaa 

y La l a l y ay a 


accaccccat 

y l l aL lll a l 


yy yyy ^* 


cacaaaatca 

La Laaaa l La 


33660 

J J l/li yj 


□tcaaaaaaa 


acaacatata 

aLuucy V- a La 


LaLLLLyy a l 


tacaaaaart 

Ly Laa aay l l 


attcaaacaa 

y l l l ayy Laa 


aLy LLLL Ly L 


33720 
j j * t-\j 


OGtCCCtCtC 
yy l\.v«v. ll v. l 


aatacaccaa 

yy La\>au^iay 


caaaacctca 

Laaay ll LLy 


aataaaacaa 

yy Laaay Lay 


ccataccaaa 

l l a l y l Laay 


cacttaccat 

Ly l l LaL Ly l 


33780 

J ml 1 L» \J 


accaaaaara 

ycv>aayuyv.y 


actcaaacaa 

a \— LLuy uv.y a 


aaaaatatac 

auaay l y l a l 


taaaacactc 
yyyy ll 


aaaacaaraa 

ay ay Lay Ly y 


ctatatactc 

l La l a LaL ll 


33840 


tacctataar 

La\.L i- y Ly aL 


otcaaaaacc 

y l Lauyuacv. 


aaaaatcaaa 

y a aay l l aaa 


aattcarrra 

ay l LLaLLLy 


y L y uy LLLy a 


aaaaat~r~ rnr 
aaaaaLLLyL 


33Q00 

■J J -7vv 


aaaaatrrar 

y UdClQ LLLaL 


rraaaaaocr 

LLaaaaay l l 


caraaaaaar 

Ly LyaaaaaL 


arttrrntat 

aL l LLLy iq l 


aaaatttrrn 

aaaa l l LLLy 


y y l LaL Lyy l 




gcgtcaccgc 


cgcgcgacac 


gcccgccccg 


ccccgcgctc 


ctccccgaaa 


cccgccgcgc 


34020 


ccacttccgc 


gttcccaaga 


caaaggtege 


gtaactccgc 


ccacctcatt 


tgcatgttaa 


34080 


ctcggtcgcc 


atcttgcggt 


gttatattga 


tgatg 






34115 
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<210> 7 

<211> 44 

<212> DNA 

<213> Artificial 

<220> 

<223> primer P5L 

<400> 7 

gcgcacgcgt ctctatcgat gaattccatt ggtgatggac atgc 44 

<210> 8 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> primer P5ITR 

<400> 8 

gcgcatttaa at cat cat ca ataatatacc tcaaac 36 

<210> 9 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> primer P5XTOP 

<400> 9 

gatacctagg aacgaggagg atttgatatt g 31 

<210> 10 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> primer P5XB0T 

<400> 10 

atgtacgcct ccgcgctcac 20 

<210> 11 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> primer P5E4 

<400> 11 

gatcgaattc ccactctgta ccccatctct g 31 

<210> 12 

<211> 31967 

<212> DNA 

<213> Simian adenovirus 



<220> 



Page 103 



WO 2005/001103 



PCT7US2004/016614 



<221> CDS 

<222> (13796).. (15322) 
<220> 

<221> CDS 

<222> (18257).. (21010) 
<220> 

<221> CDS 

<222> (27192) . . (29015) 
<400> 12 

catcatcata atatacctta tttgggaacg gtgccaatat gataatgagg aggcggggtt 60 

aggggtggag tgagggtggg gtgcggatga cgcgggcgcg gggcggggtg ggagtctgac 120 

gtggggcgcg gggtggagcg cgagggtgag ggcggggcga gggcggcggg cgcggcggaa 180 

ttgacgtaca cggtagtaag tttgagcgga aattaagtga attgggcgtg ttttttgtaa 240 

ctttttgacg tacacggtag taagtttgag cggaaattaa gtgaattggg cgtgtttttt 300 

gtaacttttt gacgtacacg gtagtaagtt tgagcggaaa ttaagtgaat tgggcgtgtt 360 

ttttgtaact ttttgacgta cacggtagta agtttgagcg gaaattaagt gaattgggcg 420 

tgttttttgt aactttttga cgtacacggt agtaagtttg agcggaaatt aagtgaattg 480 

ggcgtgtttt ttgtaacttt ttgacgtaca cggtagtaag tttgagcgga aattaagtga 540 

attgggcgtg ttttttgtaa ctttttgacg tacacggtag taagtttgag cggaaattaa 600 

gtgaattggg cgtgtttttt gtaacttttt ggtcattttg gcgcgaaaac tgagtaatga 660 

ggaagtgaga cggactctgc ccttttttac ggttgggagg gaaaactgct gatcagcgct 720 

gaactttggg ctctgacgcg gtggtttccc tacgtggcag tgccacgaga aggctcaaag 780 

tcctcgtttt attgtgtgct cagccttttt gagggtattt aaacaccgtc agaccgtcaa 840 

gaggccactc ttgagtgcga gcgagtagag ttttctcctc cgtcgctgcc gcggctgctc 900 

agtcttaccg ccaggatgcg aatgctgccg gagatcttca ccgggtcctg ggaagatgtt 960 

ttccagggac ttttagaatc tgaagacaac tttccccaac ctcctgagcc ggaggagcta 1020 

cctgaggttt cgcttcacga tctgtttgac gtggaggtgg agagccccga cggagatccg 1080 

aacgaggaag ctgttgatgg tatgttcccc gactggatga tatctcagag cgagagtgct 1140 

gaaggcagtg cggactcggg cgtttctggg gttggaaacc tggtggaggt ggatctggac 1200 

ttgaagtgtt acgaggaagg ttttcctcct agcgactcag agactgatga agcctcagaa 1260 

gcggaaggtc aagaggagtc tgtgtgtggt tatgtgaaga ttaatgaggg ggagaacctg 1320 

ctggtgttgg actgtccgga ccaacctgga catggctgtc gagcctgtga ctttcaccgg 1380 

gggaccagcg gaaacccgga agctatctgt gctttgtgct acatgcgtct gaacgagcac 1440 

tgcatataca gtgagtgtta ttcatgggtt atttatgggg aaagttgggg gaaagtcttg 1500 

agaaggggaa aagtttaaca tgtcattttt gtacttgata ggtccagttt cggacgctga 1560 

gggggattct gagtcccctg ctggtccttc ccagccctca ccctgctctt tgaccgccac 1620 

gcccgcacct gacctagtta gaccaacgcc ctgccgagtg tcctgtagac gacgtgcagc 1680 

tgttaattgc atagaagatt tattggcccc tgatgacgag aacgcacctt tgaacctgtg 1740 
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cctgaaacgc 


cctaagacat 


cttgagtgtt 


tatgctgtta 


ataaaagtgt 


tgacccttag 


1800 


atcctgtgtt 


tattccttgg 


gcgtgtgcgc 


gggtatataa 


agcagctgcg 


ggctggagtg 


1860 


ttagtttatt 


ctgatggagt 


actggagtga 


gctgcagaat 


taccagagcc 


tccggcgcct 


1920 


gctggagttg 


gcctctgcca 


gaacatccac 


ctgctggagg 


ttctgttttg 


gctcgactct 


1980 


cagtaacgtg 


gtgtatcggg 


tgaagcaaga 


gtacagctcg 


cgcttttctg 


agctgttggc 


2040 


ccgctacccg 


gctgtttttg 


tttctctgga 


tctaggccat 


cacgtttatt 


tccaagaagc 


2100 


tgtagtcaga 


tatttggatt 


tttctactcc 


cgggcgtgcg 


gtttctgcga 


ttgccttcat 


2160 


ctgctttgtg 


ctagatcgat 


ggagcgccca 


aacccgcctg 


agcccggggt 


acaccctgga 


2220 


ctacctgacc 


atgtccctgt 


ggagggccat 


gctgcggaag 


aggagggtct 


caggcttctc 


2280 


gccggcgcgg 


cctccgcacg 


gactggatcc 


ggtgctggag 


gagtcggagc 


tggaggagga 


2340 


ggagaacccg 


agggccggcc 


tggaccctcc 


ggcggaatag 


tgacggaacc 


ggaggatccc 


2400 


caagagggta 


ctagtcaggg 


gggagggggg 


ccgaagagaa 


agcgggatga 


agaggaggcg 


2460 


atggaccccg 


acaggtttct 


aaaagaactg 


actttaagct 


taatgtctaa 


gagaagaccc 


2520 


gagacggtgt 


ggtggtctga 


tttggagaag 


gagttccacc 


agggggagat 


gaatctgttg 


2580 


tacaagtatg 


ggtttgagca 


ggtgaagact 


cactggctgg 


aagcctggga 


ggactgggag 


2640 


atggctttta 


acatgtttgc 


caaggtggcg 


ctgcgcccgg 


acactattta 


caccgtgact 


2700 


aagacggtgg 


aaatccgcaa 


gcctgtgtat 


gtgattggca 


acggggccgt 


ggttcggttc 


2760 


cagaccaccg 


accgggtggc 


ctttaattgc 


tgtatgcaga 


acctgggccc 


gggggtgatt 


2820 


aatcttaatg 


gagtgacctt 


ttgcaatgtc 


agattcgcgg 


gggatggatt 


caacgggacg 


2880 


gtgtttgccg 


ccaccaccca 


gataacccta 


cacggggtgt 


tcttccagca 


tgtaggcggg 


2940 


gcttgtgtag 


atacctgggc 


gagggcctct 


gtgaggggct 


gcacctttgt 


gggctgttgg 


3000 


aaagcggtgg 


tgggtcgacc 


caagagtgtg 


ctgtctgtga 


agaaatgtgt 


gtttgagaga 


3060 


tgtctgatgg 


ccatggtggt 


ggagggccag 


ggtaggatcc 


gccataacgc 


gggctccgag 


3120 


aatacctgtt 


ttgccctgct 


gaagggtacg 


gcgaccgtga 


agcataacat 


gatctgcggg 


3180 


gtgggtcact 


cgcagctgct 


gacctgtgcg 


gatggcaact 


gccaggccct 


gcgcacggtg 


3240 


catgtggtgt 


cccaccggcg 


ccgcccctgg 


ccggtgtttg 
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tgctgatgat 


gtaattgaag 


taggcggttt 


teaaaeggeg 


gatggtggcg 


aggaggacga 


9900 


cgtctttggg 


cccggcctgc 


tggatgegea 


ggeggtegge 


catgccccag 


gcgtggctct 


9960 


ggcatcggcg 


caggtccttg 


tagtagtctt 


gcatgagtct 


etcgaegggg 


aegtegtett 


10020 
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cgtcggcccg 


gtcggccatg 


cgggtggagc 


cgaacccgcg 


caggggctgc 


agcagggcca 


10080 


ggtcggcgac 


cacgcgttcg 


gccagcacgg 


cctgctggat 


ctgggtgagc 


gtggtctgga 


10140 


agtcgtccag 


gtccacgaag 


cggtggtagg 


agcccgtgtt 


gatggtgtag 


gtgcagttgg 


10200 


ccatgacgga 


ccagttgacg 


acttgcatgc 


cgggctgggt 


gatctcggtg 


tagcggaggc 


10260 


gcgagtaggc 


ccgcgactcg 


aagacgtagt 


cgttgcaggt 


gcgcacgagg 


tactggtagc 


10320 


cgacgaggaa 


gtgcggcggc 


ggctcgcggt 


agaggggcca 


gcgcacggtg 


gcgggggcgc 


10380 


cgggggccag 


gtcctccagc 


atgaggcggt 


ggtagtggta 


gacgtagcgc 


gagagccagg 


10440 


tgatgccggc 


ggcggaggtg 


gcggcgcggg 


cgaagtcgcg 


gacgcggttc 


cagatgttgc 


10500 


gcaagggggc 


gaagcgctcc 


atggtgggca 


cgctctggcc 


ggtgaggcgg 


gcgcagtcct 


10560 


gcacgctcta 


gacgggacag 


agagcgggag 


gttagcggct 


ccgctccgtg 


gcctggggga 


10620 


cagaccgcca 


gggtgcgacg 


gcggggaacc 


ccggttcgag 


accggctgga 


tccgtccgtc 


10680 


cccgacgcgc 


cggccccgcg 


tccacgaccc 


caccagaggc 


cgagacccag 


ccgcggtgcc 


10740 


cggaccccag 


atacggaggg 


gagccttttt 


gtggtttttt 


cccgtagatg 


catccggtgt 


10800 


tgcgacagat 


gcgtccgtcg 


ccagcgccgc 


cgacgcagcc 


gccgctcccg 


ccccccacta 


10860 


gcgccgcgga 


ggctctgtcc 


ggcggccgcg 


gcgacccgga 


ggaggaggcc 


atcctcgact 


10920 


tggaagaagg 


cgagggcctg 


gcccggctgg 


gagcgccctc 


ccccgagcgc 


catccccgcg 


10980 


tgcagctggc 


gagagactcg 


cgccaggcct 


acgtgccgcc 


gcagaatctg 


ttcagggacc 


11040 


gcagcggcca 


ggagcccgag 


gagatgaggg 


accgcaggtt 


tcacgcgggg 


cgggagctgc 


11100 


gcgcgggctt 


cgaccgtcgg 


cgggtgttgc 


gcgccgaaga 


cttcgagccc 


gacgagcgca 


11160 


gcggagtaag 


tccggcacgg 


gcgcacgtgt 


cggcggccaa 


cctggtgacc 


gcgtacgagc 


11220 


agacggtgaa 


cgaggagcgg 


agctttcaga 


aaagcttcaa 


caaccacgtg 


cgcaccctga 


11280 


tcgcgcgcga 


ggaggtggcc 


atcggcctga 


tgcatctgtg 


ggactttgtg 


gaggcgtacg 


11340 


tgcagaaccc 


gtcgagcaag 


ccgctgacgg 


cgcagttgtt 


cctgatcgtg 


cagcacagtc 


11400 


gggacaacga 


gacgttccgc 


gaggcgatgc 


tgaacatcgc 


ggagcccgag 


ggccgctggc 


11460 


tcttggacct 


gattaacatc 


ctgcagagca 


tcgtggtgca 


ggagcgcagc 


ctgagcctgg 


11520 


ccgacaaggt 


ggcggccatc 


aactacagca 


tgttgagcct 


gggcaagttt 


tacgcccgca 


11580 


agatctacaa 


gagcccctac 


gtgcccatag 


acaaggaggt 


gaagatcgac 


agcttttaca 


11640 


tgcggatggc 


gctgaaagtg 


ctgacgctga 


gcgacgatct 


gggggtgtac 


cgcaacgacc 


11700 


gcatccacaa 


ggccgtgagc 


gccagccgcc 


ggcgcgagct 


gagcgaccgc 


gagctgatgc 


11760 


acagcctgcg 


gagggcgctg 


gcgggcgccg 


gcggcggcga 


ggaggccgag 


tcctacttcg 


11820 


acatgggggc 


ggacttgcag 


tggcagccca 


gcgcgcgggc 


cctggaggcg 


gcgggctacc 


11880 


gcggcggcgg 


cggcgtggtc 


gaggcggagg 


acgaggacga 


ggtggagtac 


gaggaggagg 


11940 


actgatcggc 


gaggtgtttt 


cgtagatgca 


gcgcgcgacg 


gcggcggcga 


gcgggccgca 


12000 


gggggacccc 


gccgtgctgg 


cggccctgca 


gagccaacct 


tcgggcgtga 


acgcctccga 


12060 
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13680 


gctgggtcgg 


ctgtattgag 


gaaagaaact 


aataaaagaa 
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gcttaccaga 


13740 


gccatggtcg 


cagcgtcggt 


ccctttgtgt 


gtgttttctc 


ctccccggta 


gcgaa atg 


13798 



Met 
1 



agg cgc gcg gtq gqa gtq ccg ccg gtg atg gcg tac gec gag gqt cct 13846 
Arg Arg Ala Val Gly Val Pro Pro VaT Met Ala Tyr Ala Glu Giy Pro 
5 10 15 

cct cct tct tac gaa acg gtq atg gqc gec gcg gat teg ccg gec acg 13894 
Pro pro ser Tyr Glu Thr Val Met Gly Ala Ala Asp Ser Pro Ala Thr 
20 25 30 

ctg gag gcg etc tac gtc cct ccc cgc tac ctg ggg cct acg gag ggg 13942 
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Leu Glu Ala Leu Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu Gly 

35 40 45 

agg aac age ate cgt tac tea gag ctg gcg ccg ctg tac gac ace ace 13990 

Arg Asn Ser He Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr Thr 

50 55 60 65 

cgc gtg tac ctg gtg gat aac aag teg gcg gac ate gcg teg ctg aac 14038 

Arg Val Tyr Leu val Asp Asn Lys Ser Ala Asp lie Ala Ser Leu Asn 
70 75 80 

tac cag aac gac cat age aac ttt ctg ace acg gtg gtq cag aac aat 14086 

Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn Asn 

85 90 95 

gac ttt ace ccg gtg gag gcg ggc acg cag ace ata aat ttc gac gag 

Asp Phe Thr Pro val Glu Ala Gly Thr Gin Thr lie Asn Phe Asp Glu 
100 105 110 

cgc teg egg tgg ggc ggc gac ctg aaa ace ate ctg cgc ace aac atg 14182 

Arg ser Arg Trp Gly Gly Asp Leu Lys Thr lie Leu Arg Thr Asn Met 

115 120 125 

ccc aac ate aac gag ttc atg tec ace aac aag ttc agg gee egg ttg 14230 

Pro Asn lie Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Arg Leu 

130 135 140 145 

atg gta gag aaa gtq aac aag gaa ace aat gee cct cga tac gag tgg 14278 

Met Val Glu Lys Val Asn Lys Glu Thr Asn Ala Pro Arg Tyr Glu Trp 
150 155 160 

ttt gag ttc ace ctg ccc gag gqc aac tac teg gag acc atg acc ata 14326 

Glu Gl\ " 



Phe Glu Phe Thr Leu Pro Glu Gly Asn Tyr Ser Glu Thr Met Thr lie 

165 170 175 

gac ctg atg aat aac gcg ate gtg gac aac tac ttg gaa gtg ggg egg 

Asp Leu Met Asn Asn Ala lie val Asp Asn Tyr Leu Glu Val Gly Arg 
180 185 190 

cag aac ggg gtg ctg gag age gac ate ggg gtg aag ttt gac acg cgc 

Gin Asn Gly Val Leu Glu Ser Asp lie Gly Val Lys Phe Asp Thr Arg 

195 200 205 



gtg tac acc aac gag gee ttc cac ccc gac ate gtc ctg ctg ccc gqc 
Val Tyr Thr Asn Glu Ala Phe His Pro Asp lie val Leu Leu Pro Gly 
230 235 240 



cgc aag egg atg ccc ttc cag gcg ggt ttt cag ate atg tac gag gac 
Arg Lys Arg Met Pro Phe Gin Ala Gly Phe Gin lie Met Tyr Glu Asp 
260 265 270 



gee age att cag aag gcg egg gag cag gqc cag gag ate cgc ggc gac 

Ala Ser lie Gin Lys Ala Arg Glu Gin Gly Gin Glu lie Arg Gly Asp 

290 295 300 305 

aac ttt acc gtc ate ccc egg gac gtg gag ate gtg ccc gtg gag aag 

Asn Phe Thr Val lie Pro Arg Asp Val Glu lie Val Pro val Glu Lys 
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14134 



14374 



14422 



aac ttc egg ctg ggc tgg gac ccg gtc acc aag ctg gtc atg ccc ggc 14470 
Asn Phe Arg Leu Gly Trp Asp Pro Val Thr Lys Leu Val Met Pro Gly 
210 " 215 220 225 



14518 



tgc ggc gtg gac ttc acg cag age egg ctg age aac ctg ctg ggg ate 14566 
Cys Gly val Asp Phe Thr Gin ser Arg Leu Ser Asn Leu Leu Gly lie 
245 250 255 



14614 



ctg gag ggc ggc aac ate ccc gee ttg eta gac gtg gcg aaa tac gag 14662 
Leu Glu Gly Gly Asn lie Pro Ala Leu Leu Asp Val Ala Lys Tyr Glu 
275 280 285 



14710 
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310 315 320 

gat age aag gac cgc agt tac aac eta etc ccc gqc gac cag acc aac 14806 

Asp Ser Lys Asp Arg Ser Tyr Asn Leu Leu Pro Gly Asp Gin Thr Asn 
325 330 335 

acg gec tac cgc age tgg ttc ctg gee tac aac tac gqc gac ccc gag 14854 

Thr Ala Tyr Arg Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro Glu 
340 ~ 345 350 

aag gqc gtc agg tec tgg acg ctg ctg acc acc acg gac gtc acc tgc 14902 

Lys Gly val Arg Ser Trp Thr Leu Leu Thr Thr Thr Asp Val Thr Cys 

355 360 365 

gqc teg cag cag gtg tac tgg teg etc ccg gac atg atg caa gac ccc 14950 

Gly Ser Gin Gin val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp Pro 
370 375 380 385 

gtq acc ttc egg ccc tec age caa gtc age aac tac ccc gtq gtq gga 14998 

Val Thr Phe Arg Pro Ser Ser Gin Val Ser Asn Tyr Pro Val Val Gly 
390 395 400 

gtc gag etc ctg ccg gtq cac gee aag age ttt tac aac gag cag gee 15046 
Val Glu Leu Leu Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gin Ala 
405 410 415 

gtc tac teg cag etc ate cgc cag tec acc gcg etc acg cac gtc ttc 15094 
val Tyr Ser Gin Leu lie Arg Gin Ser Thr Ala Leu Thr His Val Phe 
420 425 430 

aac cgc ttc ccc gag aac cag ate ctg gtq cgc ccg ccc get ccg acc 15142 

Asn Arg phe Pro Glu Asn Gin lie Leu Val Arg Pro Pro Ala Pro Thr 

435 440 445 

att acc acc gtc agt gaa aac gtt ccc gee etc aca gat cac gga acc 15190 

lie Thr Thr Val Ser Glu Asn Val Pro Ala Leu Thr Asp His Gly Thr 
450 455 460 465 

ctg ccg ctg cgc age agt ate agt gga gtc cag cgc gtg acc ate act 15238 

Leu Pro Leu Arg Ser Ser lie Ser Gly Val Gin Arg Val Thr lie Thr 
470 475 480 

gac gee egg cga agg acc tgc ccc tac gtg cac aag gee ctg ggc ata 15286 

Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val His Lys Ala Leu Gly lie 
485 490 495 

gtc get ccc aaa gtg etc tct age cgc acc ttt taa caagcatgtc 15332 
Val Ala Pro Lys Val Leu Ser Ser Arg Thr Phe 
500 505 

cattctcatc tcgcccgaca acaacaccgg ctggggcctg cgctcggccg geatgtaegg 15392 

cggcgccaag cggcgctcca gcgagcaccc cgtccgcgtc cgcggccact accgggcccc 15452 

ctggggcgcc cacaagcgcg gcgtctccac gcgcaccacc gtcgacgacg ccatcgacgc 15512 

cgtcgtggcc caggccagac gctaccgccg gcccaagtcg acggtggacg ccgtcatcga 15572 

cagcgtggtg gccgacgcgc ggegataege tegaegcaag cggcgtctgc accgccgtcg 15632 

ccgtcccacc gccgccatgc tggccgccag ageggtcctg agacgcgcgc gccgcgtggg 15692 

ccgccgagcc atgegecgag ccgcggccaa cgccagcgcg ggtcgcgccc gtegtcagge 15752 

cgcccggcag gccgccgccg ccatcgccaa cctggcccaa ccccgccggg gaaacgtgta 15812 

ctgggtgcga gaegegtegg gcgtgcgcgt gccggtgcgc acccgccccc cteggagtta 15872 

gaagacaaaa agaeggaega agactgagtt tccctgtcgt tgccagcatg ageaagegea 15932 
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agttcaaaga 


agagctgctg 


gaggccctcg 


tgcccgagat 


ctacggcccg 


gccgccgctg 


15992 


ccgccgcggt 


ggcggacgtc 


aagcccgaag 


ttaagccccg 


cgcgctgaag 


cgggttaaaa 


16052 


agcgggaaaa 


gaaagaggag 


aagcaggaag 


cagggttgct 


agacgtcgac 


gacggcgtgg 


16112 


agttcgtgcg 


gtccttcgcg 


ccccgtcgcc 


gggtgcagtg 


gcggggtcgc 


cgcgtcaagc 


16172 


tcgtcccgcg 


gccgggcacc 


gtggtgtctt 


tcacccccgg 


cctgcgttcg 


gccacgcgcg 


16232 


gcctgaagcg 


cgagtacgac 


gaggtctatg 


gcgacgaaga 


catcctggag 


caggccgccc 


16292 


agcagctcgg 


ggagtttgct 


tacggcaagc 


gcggccgcta 


cggggaggtg 


gcgctggcgc 


16352 


tggaccaggg 


caatcccacg 


cccagcctca 


agcccgtcac 


gctgcagcag 


gtgctgcccg 


16412 


tgagcgcgtc 


gaccgagagc 


aagcggggca 


tcaagaggga 


gatgggcgac 


ctgcagccca 


16472 


ccatgcaact 


catggtgccc 


aaacggcaga 


agctggagga 


cgtgctggag 


aacatgaaag 


16532 


tggatcccag 


catcgagccc 


gaagtgaaag 


tgcgacccat 


caaggaagtg 


ggcccgggcc 


16592 


taggcgtgca 


gacggtggac 


attcagatcc 


ccgtgcgcgc 


ctcccccgtt 


tctgccacca 


16652 


ctacgacggc 


cgtggaggcc 


atggaaacgc 


agacggagct 


gcccgcggcc 


ttggcggcag 


16712 


ccgccaccgc 


cgccgcggct 


acccgagaga 


tgggcatgca 


gaccgacccc 


tggtacgagt 


16772 


tcgccggccc 


cgcccgtcgt 


ccacgagccc 


gtcggtacgc 


ggcgaccacc 


tcccggctcc 


16832 


ctgactacgt 


cttgcatcct 


tccatcacgc 


cgacgcccgg 


ctaccgcgga 


acgaccttcc 


16892 


gccccggtcg 


cgcgcgcacc 


accacccgcc 


gtcgtcgcac 


cacccgccgc 


cgtcgcagcc 


16952 


gtcgcgcact 


ggctcccatc 


gcggttcgcc 


gcgtcgtccg 


ccggggtcgc 


acgctgaccc 


17012 


tgcccaccgc 


gcgttaccac 


cccagcatcg 


tcatttaacc 


tgcgctgccg 


ttttgcagat 


17072 


ggctctgacg 


tgccgctttc 


gcttccccgt 


tcggcactac 


cgaggaagat 


ctcgccgtag 


17132 


gactggtcta 


gcgggcagcg 


gtctccgacg 


ccgccgccgc 


gcggtgcacc 


ggcgcatgaa 


17192 


gggcggcatt 


ctgcccgcgc 


tgatccccat 


tatcgccgcc 


gccatcgggg 


cgatccccgg 


17252 


cgtggcctcg 


gtggccttgc 


aagcagctcg 


caaaaattaa 


ataaagaagg 


cttgacactc 


17312 


actgcctggt 


cctgactgtt 


tcatgcagac 


aagacatgga 


agacatcaat 


tttgcgtcgt 


17372 


tggccccgcg 


gcacggctcg 


cggccgttca 


tgggcacctg 


gaacgagatc 


ggcaccagcc 


17432 


agctcaacgg 


gggcgctttc 


agttggagca 


gcctgtggag 


cggcattaaa 


aactttgggt 


17492 


ccacgattaa 


gacctatggc 


aacaaggcgt 


ggaacagtag 


cactggtcag 


atgctccgcg 


17552 


ataagctgaa 


ggaccagaac 


ttccagcaga 


aagtggtaga 


cggtctggcc 


tcgggcatca 


17612 


acggggtggt 


ggacctggcc 


aaccaggcgg 


tgcagaacca 


gatcaaccag 


cgtctggaga 


17672 


acagccgcca 


gccgcccgcg 


gccctgcagc 


agcgtccgca 


ggtggaggag 


gtggaagtgg 


17732 


aggagaagct 


gccgcccctg 


gagacggtgt 


cgccggtggg 


cgtgcctagc 


aagggggaga 


17792 


agcggccgcg 


gcccgagctc 


gaggagaccc 


tagtgaccga 


gaccctggag 


ccgccctcgt 


17852 


acgagcaggc 


cttgaaagag 


ggggccacgc 


ccctgcccat 


gacccggccc 


atcggaccca 


17912 


tggcccgacc 


ggtctacggc 


aaggaacaca 


aagccgtgac 


gctagagctg 


cctccgccgg 


17972 
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cgcccaccgt acccccgatg cccggtccca ccctgggcac cgccgtgcct cgtcccgccg 

ccccgccggt cgccgtggcc acgcccgcgc gcccgagtcg cggagccaac tggcagagca 

ctctgaacag catcgtgggc ctgggagtga aaagcctgaa acgccgccgg tgttactatt 

aaagccagct aaatacccat gtgttgtatg cgcctcctgt gtcacgccag aaaaagccag 

ccgagtgacg ggtcaccgcc gccgccaaga gcgccgcttt caag atg gcc acc ccc 

Met Ala Thr Pro 
510 

teg atg atg ccg cag tgg tct tac atg cac ate gcc ggg cag gac gcc 
Ser Met Met Pro Gin Trp Ser Tyr Met His lie Ala Gly Gin Asp Ala 



515 



520 



525 



teg gag tac ctg age ccg ggc ctg gtg cag ttc gcc cgc gcc acc gac 
Ser Glu Tyr Leu Ser Pro Gly Leu val Gin Phe Ala Arg Ala Thr Asp 



530 



535 



540 



acg tac ttc age ctg ggc aac aag ttt agg aac ccc acg gtg gcc ccc 
Thr Tyr Phe Ser Leu Gly Asn Lys Phe Arg Asn Pro Thr vat Ala Pro 



545 



550 



555 



560 



acc cac gac gtg acg acg gac egg tec cag egg ctg acg ctg egg ttc 

Thr His Asp val Thr Thr Asp Arg Ser Gin Arg Leu Thr Leu Arg Phe 

565 " 570 575 

gtg ccc gtc gac cgc gag gac acc gcg tac teg tac aaa gtg cgc ttc 

Val Pro val Asp Arg Glu Asp Thr Ala Tyr Ser Tyr Lys Val Arg Phe 

580 585 590 

acg ctg gcc gtg ggc gac aac cgc gtg ctg gac atg gcc age acg tac 

Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met Ala Ser Thr Tyr 
595 600 605 



ttt gac ate cgc ggc gtg ttg gac cgc ggt ccc age ttc aaa ccc tac 
Phe Asp lie Arg Gly val Leu Asp Arg Gl\ 

610 ~ 615 620 



Leu Asp Arg Gly Pro Ser Phe Lys Pro Tyr 



tec ggc acc gcc tac aac tec ctg gcc ccc aag ggc gcc ccc aac ccg 
ser Gly Thr Ala Tyr Asn ser Leu Ala Pro Lys Gly Ala Pro Asn Pro 
625 630 635 640 

tea gaa tgg aag ggc tea gac aac aaa att agt gta aga ggt cag get 
Ser Glu Trp Lys Gly Ser Asp Asn Lys lie Ser val Arg Gly Gin Ala 
645 650 655 

ccg ttt ttt agt aca tec att aca aag gat ggt att caa gtg gcc act 
Pro Phe Phe Ser Thr Ser lie Thr Lys Asp Gly lie Gin val Ala Thr 
660 665 670 

gat act tct age gga get gtg tat get aaa aag gaa tat cag cct gaa 
Asp Thr ser ser Gly Ala val Tyr Ala Lys Lys Glu Tyr Gin Pro Glu 



675 



680 



685 



cca caa gta ggg caa gaa caa tgg aac age gaa gcc agt gat agt gat 
Pro Gin val Gly Gin Glu Gin Trp Asn Ser Glu Ala Ser Asp Ser Asp 



690 



695 



700 



aaa gta get ggt agg att eta aaa gac aca aca ccc atg ttc cct tgt 
Lys Val Ala Gly Arg lie Leu Lys Asp Thr Thr Pro Met Phe Pro Cys 
705 710 715 720 

tac ggt tec tac gcc aag ccc aca aat gaa cag ggg ggg caa ggc act 
Tyr Gly Ser Tyr Ala Lys Pro Thr Asn Glu Gin Gly Gly Gin Gly Thr 
725 730 735 

aat act gta gat ctg cag ttc ttt gcc tct tea teg get acc tct acg 

page 114 



18032 
18092 
18152 
18212 
18268 

18316 

18364 

18412 

18460 

18508 

18556 

18604 

18652 

18700 

18748 

18796 

18844 

18892 

18940 

18988 



WO 2005/001103 PCT7US2004/016614 



Asn Thr Val Asp Leu Gin Phe Phe Ala Ser Ser Ser Ala Thr Ser Thr 
740 745 750 

cct aaa gcc gta etc tat gec gag gac gtg gca ata gaa gca cca gac 19036 

pro Lys Ala val Leu Tyr Ala Glu Asp val Ala lie Glu Ala Pro Asp 
755 760 765 

acc cat ttg gtq tac aaa ccg gca gtt aca acc acg acc act agt tec 19084 

Thr His Leu Val Tyr Lys Pro Ala val Thr Thr Thr Thr Thr Ser Ser 
770 775 780 

caa gac ctg eta act cag cag get get ccc aac cga ccc aac tac att 19132 

Gin Asp Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg Pro Asn Tyr lie 

785 790 795 800 

ggc ttc agg gat aat ttt ate ggt etc atg tat tac aac tec act ggc 19180 

Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr Asn Ser Thr Gly 
805 810 815 

aat atg ggt gtt ttg gca ggg caa get tct cag eta aac gcg gtg gtt 19228 

Asn Met Gly Val Leu Ala Gly Gin Ala ser Gin Leu Asn Ala val Val 
820 825 830 

gac ttg caa gac aga aac acc gag ctg tec tac cag etc atg ctt gat 19276 

Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin Leu Met Leu Asp 
835 840 845 

get ttg ggc gac cgc agt cgt tac ttc tec atg tgg aac cag gcc gta 19324 

Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp Asn Gin Ala Val 
850 855 860 

gac age tat gac cct gat gtc aga att att gaa aat cat ggt gtq gag 19372 

Asp Ser Tyr Asp Pro Asp Val Arg lie He Glu Asn His Gly Val Glu 

865 870 875 880 

gat gag ctg cca aac tac tgt ttc ccg eta gga ggg teg eta gta act 19420 

Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly Ser Leu Val Thr 
885 890 895 

gaa act tat aca ggc eta tea ccc caa aac gga agt aac acg tgg aca 19468 

Glu Thr Tyr Thr Gly Leu Ser Pro Gin Asn Gly Ser Asn Thr Trp Thr 
900 905 910 



acc gac age acc acc tat gca act aga ggg gtg gaa ate ggc tct ggc 
Thr Asp ser Thr Thr Tyr Ala Thr Arg Gly Val Glu lie Gly Ser Gly 
915 920 925 



19516 



aac atg ttc gcc atg gaa att aat ttg gcg gcc aat eta tgg agg agt 19564 

Asn Met Phe Ala Met Glu lie Asn Leu Ala Ala Asn Leu Trp Arg Ser 
930 935 940 

ttc ctg tac tec aac gtg gcc ctg tac ctg ccc gac gag tac aag etc 19612 

Phe Leu Tyr Ser Asn Val Ala Leu Tyr Leu Pro Asp Glu Tyr Lys Leu 

945 950 955 960 

acc ccc gac aac ate acc etc ccc gac aac aaa aac act tac gac tac 19660 

Thr Pro Asp Asn lie Thr Leu Pro Asp Asn Lys Asn Thr Tyr Asp Tyr 

965 970 975 

atg aac ggc cgc gtg gcc gcc ccc age tec etc gac acc tac gtc aac 19708 

Met Asn Gly Arg Val Ala Ala Pro Ser Ser Leu Asp Thr Tyr Val Asn 
980 985 990 

ate geig gcg cgc tgg tec ccc gac ccc atg gac aac gtc aac ccc ttc 19756 



He Gly Ala Arg Trp Ser Pro Asp Pro Met Asp Asn Val Asn Pro Phe 
995 1000 1005 

aac cac cac cgc aac gcg gga ctg cgc tac cgc tec atg ctg ctg 
Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg Ser Met Leu Leu 
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1010 



1015 



gqc aac gqc cgc tac gta ccc 
Gly Asn Gly Arg Tyr Val Pro 



ttc cac ate caa 
Phe His lie Gin 



1025 



1030 



ttc ttc gec ate aaa aac etc 
Phe Phe Ala lie Lys Asn Leu 
1040 1045 

tac gag tgg aac ttc cgc aag 
Tyr Glu Trp Asn Phe Arg Lys 
1055 1060 



age ctg 

Ser Leu Gly Asn Asp Leu Arg 
1070 1075 



gqt aac gac etc cgc 
5l\ 



gac age ate aac ctg tac gec 
Asp Ser lie Asn Leu Tyr Ala 
1085 1090 

ace gee tec acg etc gag gec 
Thr Ala Ser Thr Leu Glu Ala 
1100 1105 

cag teg ttc aac gac tac etc 
Gin ser Phe Asn Asp Tyr Leu 
1115 1120 

ate ccc gee aac gee ace age 
lie Pro Ala Asn Ala Thr Ser 
1130 1135 



ctg etc etc ccc 
Leu Leu Leu Pro 



gac gtc aac atg 
Asp val Asn Met 



gtc gac gqg gec 
val Asp Gly Ala 



aac ttc ttc ccc 
Asn Phe Phe Pro 



atg ctg cgc aac 
Met Leu Arg Asn 



tgc get gee aac 
Cys Ala Ala Asn 



1020 

gtq ccc cag aaa 19846 

Val Pro Gin Lys 

1035 

gqg tec tac ace 19891 

Gly Ser Tyr Thr 

1050 

ate etc cag age 19936 

lie Leu Gin Ser 

1065 

age gtc agg ttc 19981 

Ser val Arg Phe 

1080 

atg gec cac aac 20026 

Met Ala His Asn 

1095 

gac ace aac gac 20071 

Asp Thr Asn Asp 

1110 

atg etc tac ccc 20116 

Met Leu Tyr Pro 

1125 



gtq ccc ate tec 
val Pro lie Ser 



att ccc teg egg 
lie Pro Ser Arg 
1140 



20161 



aac tgg gee gec ttc egg gqc 
Asn Trp Ala Ala Phe Arg Gly 
1145 1150 



tgg age ttc ace 
Trp Ser Phe Thr 



egg etc aag acc 
Arg Leu Lys Thr 
1155 



20206 



aag gag acc ccc tct ctg gqc 
Lys Glu Thr Pro ser Leu Gly 



1160 



1165 



tec gqc ttc gat 
Ser Gly Phe Asp 



ccc tac ttc acc 
Pro Tyr Phe Thr 
1170 



20251 



tac teg gqc tec ate ccc tac 
Tyr Ser Gly Ser He Pro Tyr 



1175 



1180 



ctg gac gqc acc 
Leu Asp Gly Thr 



cac act ttc aag aag gtc tec 
His Thr Phe Lys Lys Val Ser 
1190 1195 



ate atg ttc gac 
lie Met Phe Asp 



ttc tac etc aac 20296 

Phe Tyr Leu Asn 

1185 

tec tec gtc age 20341 

Ser ser Val Ser 

1200 



tgg ccc gqc aac gac cgc ctg 
Trp Pro Gly Asn Asp Arg Leu 
1205 1210 



ctg acc ccc aac 
Leu Thr Pro Asn 



gag ttc gag ate 
Glu Phe Glu lie 
1215 



20386 



aag cgc 
Lys Arg 
1220 



acc gtq gac gqg gaa 
Thr Val Asp Gly Glu 
1225 



gqg tac aac gtq 
Gly Tyr Asn val 



gee cag tgc aac 
Ala Gin cys Asn 
1230 



20431 



atg acc aag gac tgg ttc etc 

Met Thr Lys Asp Trp Phe Leu 
1235 1240 

ate gqc tac cag gqc ttc tac 

lie Gly Tyr Gin Gly Phe Tyr 
1250 1255 



ate cag atg etc 
lie Gin Met Leu 



gtq ccc gag gqc 
val Pro Glu Gly 



age cac tac aac 20476 

Ser His Tyr Asn 

1245 

tac aag gac agg 20521 

Tyr Lys Asp Arg 

1260 



atg tac tct ttc ttc cgc aac 
Met Tyr Ser Phe Phe Arg Asn 
1265 1270 



ttc caa ccc atg 
Phe Gin Pro Met 
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gtc gac acc acc acc tac acc gac tac aaa aac gtc acc etc ccc 20611 

Val Asp Thr Thr Thr Tyr Thr Asp Tyr Lys Asn Val Thr Leu Pro 

1280 1285 1290 

ttc cag cac aac aac teg gqg ttc gtq gqa tac atg ggc ccc acc 20656 

Phe Gin His Asn Asn Ser Gly Phe val Gly Tyr Met Gly Pro Thr 

1295 1300 1305 

atg cgc gag gqg cag gec tac ccc gec aac tac ccc tac ccc ctg 20701 

Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu 

1310 1315 1320 

ate ggc aag acc gee gtq ccc age etc acg cag aaa aag ttc etc 20746 

lie Gly Lys Thr Ala Val Pro Ser Leu Thr Gin Lys Lys Phe Leu 

1325 1330 1335 

tgc gac cgc acc atg tgg cgc ate ccc ttc tec agt aac ttc atg 20791 

cys Asp Arg Thr Met Trp Arg lie Pro Phe Ser Ser Asn Phe Met 

1340 1345 1350 

tec atg gqg gcg etc acc gac ctg gqg cag aac atg ctg tac gee 20836 

Ser Met Gly Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala 

1355 1360 1365 

aac tec gee cac gee etc gac atg acc ttc gag gtq gac ccc atg 20881 

Asn Ser Ala His Ala Leu Asp Met Thr Phe Glu val Asp Pro Met 

1370 1375 1380 

gat gag ccc acg ctt etc tat gtt ctg ttc gaa gtq ttc gac gtc 20926 

Asp Glu Pro Thr Leu Leu Tyr Val Leu Phe Glu val Phe Asp Val 

1385 1390 1395 

gtq cgc ate cac cag ccg cac cgc ggc gtc ate gag gec gtc tac 20971 

val Arg lie His Gin Pro His Arg Gly Val lie Glu Ala val Tyr 

1400 1405 1410 



ctg cgc acg ccg ttc teg gec gqt aac gec acc acc taa ggagggggee 21020 
Leu Arg Thr Pro Phe Ser Ala Gly Asn Ala Thr Thr 



1415 




1420 




1425 






gecgaeggat 


gggctccagc 


gagceggage 


tggtcgecat 


cgcgcgcgac 


ctgggctgcg 


21080 


ggccctactt 


cctgggcacc 


tttgacaaac 


gcttcccggg 


cttcgtggcg 


ccgcacaagc 


21140 


tggcctgcgc 


catcgtcaac 


accgccggac 


gegagacegg 


cggcgtccac 


tggctggccc 


21200 


tggcctggaa 


cccccgcagc 


cgaacctgct 


acctcttcga 


ccccttcggc 


ttctcggacg 


21260 


acaggctcag 


gcagatctac 


cagttcgagt 


aegaaggect 


gctccggcgc 


agcgccctcg 


21320 


cctccacccc 


cgaccactgc 


gtcaccctcg 


tcaagtccac 


ccagaccgtc 


caggggcccc 


21380 


gctcggccgc 


ctgcggcctc 


ttctgetgea 


tgttcctgca 


cgccttcgtg 


cgctggcccg 


21440 


cctcccccat 


ggaeggcaac 


cccaccatgg 


acctccttac 


gggcgttccc 


aacagcatgc 


21500 


ttcagagtcc 


ccaggtcgag 


cccaccctcc 


accgcaacca 


ggaggaactc 


tacgccttcc 


21560 


tggcteggea 


ctccccctac 


tttcgccgcc 


accgcgagcg 


catagaaaag 


gccaccgcgt 


21620 


ttgacaaaat 


gaacgactag 


attttctgtg 


aaaaacactc 


aataaagect 


ttattggttc 


21680 


accacacgtg 


cacgcatgca 


gactttttat 


ttaaaagggc 


tccgcctcct 


cgtcgccgtg 


21740 


gctggtgggg 


agggagacgt 


tgcgatactg 


caggegggag 


ctccatctga 


actegggaat 


21800 


cagcagcttg 


ggcagggggc 


cctcgacgtt 


ctcgctccac 


agettgegea 


ccagctgcag 


21860 
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nroanrrraa 

yLyayLLLaa 


LLLLLy LL LL 


a a rttrtarr 

ddL L LL LdLL 


LLy LLLL LyL. 


yy Ly LLLydy 


73fiOO 


y^yL LyyL.ua 


r r t" a 1* r a c a 1* 

ll La LLQLa L 


rttfttraan 
l l ll l LLaay 


a a rranrnra 
ddLLdy Ly Ld 


1r r c c c clr ct c 
LLLLLL. Ll Ll 


y LyLLy LyLL 


73fifiO 




y Ly LLy aLLy 


c etc c t*o ra t" 

LL LLL Ly La L 


cTccr\7\t\ffri 
l LLLy dy LLy 


nrnrrrir*ra"t" 

y LyLLy LLd L 


arrtnanatr 
dLL Lydyd LL 


73770 


nrrtrrrtnn 
yLL llll iyy 


annaafitTTP 
ay y aay llll 


raaoatrttr 

Laay a ll l ll 


yddyy ll LLy 


nr a a nna rna 
y Lddy ydL.yd 


yddycgcgLg 




gcaaacgctc 


tggaaaagaa 


egagagegag 


ggtcagaacg 


tgctggtcga 


gctggaaggc 


23840 


gacaacgcgc 


gtctggccgt 


gctcaaacgc 


accatcgaag 


tctcccactt 


cgcctacccc 


23900 


gcgctcaacc 


ttccccccaa 


ggtcatgege 


teggtcatgg 


ateagctget 


catcaagcgc 


23960 
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gccgagcccc 


tcgagaacga 


ctccgaggtg 


gattccgagg 


acggaaaacc 


cgtggtctcg 


24020 


gacgaggagc 


tcgcgcgctg 


gctgggcacg 


caggaccccg 


ccgagttgca 


agagcggcgc 


24080 


aagatgatga 


tggcggccgt 


gctggtcacc 


gccgagctcg 


agtgcctgca 


gcgcttcttc 


24140 


gccgaccccc 


agaccctgcg 


caaggtcgag 


gagtccctgc 


actacgcctt 


ccgccacggc 


24200 


tacgtgcgcc 


aggcctgcaa 


gatctccaac 


gtggagctta 


gcaacctggt 


ctcctacatg 


24260 


ggcatcctgc 


acgagaaccg 


cctcgggcag 


aacgtcctcc 


actgcaccct 


gaccggggag 


24320 


gcccgccgcg 


actacgtccg 


cgactgcatc 


tacctctttc 


tcaccctcac 


ctggcagacc 


24380 


gccatggggg 


tctggcagca 


gtgtctggag 


gagcgcaacc 


ttcgcgagct 


cgacaagcta 


24440 


ctgagccgcg 


agcgccgcga 


gctctggacg 


gctttcagcg 


agcgcaccgc 


cgcctgccgt 


24500 


ctggccgacc 


tcatcttccc 


cgagcgactc 


aggcaaaccc 


tccagaacgg 


cctgcccgac 


24560 


tttgtcagcc 


agagcatgct 


gcaaaacttt 


cgctccttca 


tcctggagcg 


atccggcatc 


24620 


ttgcccgcca 


tgagctgcgc 


cctgccctcc 


gatttcgtcc 


ccctctatta 


tcgcgagtgc 


24680 


cccccgccgc 


tctggagcca 


ctgctacctg 


ctgcgtctgg 


ccaactacct 


cgcccaccac 


24740 


tccgacctca 


tggaagactc 


cagcggcgag 


gggctgctgg 


agtgccactg 


ccgctgcaac 


24800 


ctctgcaccc 


cccaccgctc 


gctggtctgc 


aacaccgagc 


tgctcagcga 


gacgcaagtg 


24860 


atcggtacct 


ttgagatcca 


gggaccagag 


gggccggagg 


gtgcttccaa 


cctcaagctc 


24920 


agcccggcgc 


tctggacttc 


cgcctacctg 


cgcaaattta 


tccccgagga 


ctatcacgcc 


24980 


caccagatcc 


aattctacga 


agaccaatcg 


cgacccccca 


aagcccccct 


cacggcctgt 


25040 


gtcatcaccc 


agagccagat 


tctggcccaa 


ttgcaagcca 


tccagcaggc 


ccgccaagag 


25100 


ttcctcctga 


aaaagggtca 


cggggtctat 


ctggaccccc 


agaccggcga 


ggaactcaac 


25160 


accccgtcac 


cctccgccgc 


cgcttcgtgc 


cgcccgcaga 


accatgccgc 


ccaaagggaa 


25220 


caagcaggcc 


atcgcccagc 


ggcgggccaa 


gaagcagcaa 


gagctccagg 


agcagtggga 


25280 


cgaggagtcc 


tgggacagcc 


aggcggagga 


agtctcagac 


gaggaggagg 


acatggagag 


25340 


ctgggacagc 


ctagacgagg 


aggaggaggc 


cgaggagcta 


gaggacgagc 


ctctcgagga 


25400 


ggaagagccc 


agcagcgccg 


cggcaccatc 


ggcttccaaa 


gaagcggctc 


ggagccggcc 


25460 


ggccccgaag 


cagcagaagc 


agcaacagcc 


gccaccgtcg 


cccccgacgc 


caccaccagg 


25520 


ctcactcaaa 


gccagccgta 


ggtgggacgc 


ggtgtccatc 


gcgggatcgc 


ccaaagcccc 


25580 


agtcggtaag 


ccacccgggc 


ggtcgcggcg 


ggggtactgt 


tcctggcgcc 


cccacaagag 


25640 


caagatcgtc 


gcctgcctcc 


agcactgccg 


gggcaacatc 


tccttcgcgc 


ggcgctactt 


25700 


gctcttccac 


gacggggtgg 


cggtgccgcg 


caacgtcctc 


tactattacc 


gtcatctcta 


25760 


cagcccctac 


gagacagaag 


gcccggcctc 


cgcgtaagac 


cagccgccag 


acggtctcct 


25820 


ccgccatcgc 


gacccgccag 


gactcggccg 


ccacgcagga 


gctcagaaaa 


cgcatctttc 


25880 


ccaccctgta 


tgctatcttc 


cagcagagcc 


gcggccagca 


gctggaactg 


aaagtaaaaa 


25940 


accgctccct 


gcgttcgctc 


acccgcagct 


gtctgtacca 


caggagggaa 


gaccaactgc 


26000 
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agcgcacgct 


cqaqqacqcc 


gaggcactgt 


tcaataaata 


ctgctcggtg tctcttaagg 


26060 


actgaaagcc 


cgcgcttttt 


cagaggctca 


ttaegtcate 


atcatcatga gcaaggacat 


26120 


tcccacgcct 


tacatgtgga 


gctaccagcc 


qcaqatqqqa 

7*"** 7** w 77 7** 


ctggcggccg gcgcctccca 


26180 


ggattactcc 


agtcgcatga 


actggctgag 


tgccggcccc 


cacatgatcg ggcgggtcaa 


26240 


tgggattcgt 


gccacccgca 


atcagatact 


gctggaacag 


gccgccctca cctccacccc 


26300 


gcgacgtcag 

777 7 


ctgaacccgc 


ccgcttggcc 


cgccgcccag 


gtgtaccagg aaaaccccgc 


26360 


cccgaccaca 


gtcctcctgc 


cacgcgacgc 


ggaggccgaa 
7777 ^ 3^** 


gtccagatga ctaactccgg 


26420 


ggcgcaatta 


gcgggcggcg 

7773777 


cccgccacgt 


cgtcgctccc 


qqqtacaqaq qtcqqcccqc 


26480 


accctacccc 


tccggcccta 


taaagaggct 


gatcattcga 


ggccgaggta tccagctcaa 


26540 


cgacgaggtg 


gtgagctcct 


cgaccggtct 


tcggcccgac 


qqaqtcttcc aacttaaaaa 


26600 


cgccggccgc 

7 7 7 7 


tcttccttca 


ccactcgcca 


ggcctacctg 


acgctccaga gctcttcctc 


26660 


ccagcctcgc 


tccqqcqqca 

7777 


tcggcaccct 


ccagttcgtg 


qaqqaqttcq tqccctcaat 


26720 


ctacttcaac 


ccgttctccg 


gctctcccgg 


ccgctacccg 


gacagcttca tccccaacta 


26780 


cgacgcggtg 


agcgaatccg 


tggacqgeta 

77 77 ^* 


cqattqatqa 


ccgatggtgc ggcegtaact 


26840 


gcgcggcggc 

7 7 77 77 


aacatctgca 


tcactgccat 


cgtcctcqqt 


qcttcqcccq qqaqqectat 

3 7 3 7 7 77 7 


26900 


gagttcatct 


acttccagct 


cgccccggac 


cagcttcagg 


gcccttcgca cggcgttaag 


26960 


ctcgtgatag 


aggaagagct 


cqaqaqtaqc 


tqcctqcqct 

w -*m ^» 7 7 


ottttacctc acaccccatc 


27020 


t. Lag Lcyciyci 


gggaacgegg 


taggaccacc 


crcaccctic l 


actgeatctg tgactccccg 


Z7080 


gaattacatg 


aagatctgtg 


ttgecttcta 


tgtgccgaac 


aataacccct cttgtaacta 


27140 


cctacatcca 


caataaacca 


gaatttggaa 


actcctttcg 


tttgtttgca g atg aaa 


27197 



Met Lys 



cgc gec cgc etc gac gac gac ttc aac ccc gtc tac ccc tat gac 27242 
Arg Ala Arg Leu Asp Asp Asp Phe Asn Pro val Tyr Pro Tyr Asp 
1430 1435 1440 

act ccc aac get ccc tct gtt ccc ttc ate act cct ccc ttc gtc 27287 
Thr Pro Asn Ala Pro Ser Val Pro Phe lie Thr Pro Pro Phe Val 
1445 1450 1455 

tec teg gac gqc ttg caa gaa aaa cca ccc gga atg etc agt etc 27332 
Ser Ser Asp Gly Leu Gin Glu Lys Pro Pro Gly Met Leu Ser Leu 
1460 1465 1470 

aac tac caa gat cct att ace ace caa aac gqg gca tta act eta 27377 
Asn Tyr Gin Asp Pro lie Thr Thr Gin Asn Gly Ala Leu Thr Leu 
1475 1480 1485 

aag ctt gqc age gga ctg aac ata aac caa gat gqg gaa ctt ace 27422 
Lys Leu Gly ser Gly Leu Asn lie Asn Gin Asp Gly Glu Leu Thr 
1490 1495 1500 

tea gac gee age gtt etc gtc act ccc ccc att aca aaa gee aac 27467 
Ser Asp Ala Ser Val Leu Val Thr Pro Pro lie Thr Lys Ala Asn 
1505 1510 1515 

aac aca ata gqc eta gee ttc aat gca cct ctt ace ttg caa age 27512 
Asn Thr lie Gly Leu Ala Phe Asn Ala Pro Leu Thr Leu Gin ser 
1520 1525 1530 
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gat act tta aat ctt get tgt aac gcc cca ctt acc gtg caa gac 27557 
Asp Thr Leu Asn Leu Ala Cys Asn Ala Pro Leu Thr Val Gin Asp 
1535 1540 1545 

aat agg ttg gqa ata aca tac aac tct ccc etc acc ttg caa aac 27602 
Asn Arg Leu Gly lie Thr Tyr Asn ser Pro Leu Thr Leu Gin Asn 
1550 1555 1560 

age gaa ctt gcc eta gcg gtc acc ccg cct ctt gac act gcc aat 27647 
Ser Glu Leu Ala Leu Ala Val Thr Pro Pro Leu Asp Thr Ala Asn 
1565 1570 1575 

aac aca ctt gcg ctt aaa acc gcc egg cct ata att aca aac tct 27692 
Asn Thr Leu Ala Leu Lys Thr Ala Arg Pro lie lie Thr Asn Ser 
1580 1585 ~ 1590 

aat aac gag ctt aca etc tec get gat get ccc eta aac acc age 27737 
Asn Asn Glu Leu Thr Leu Ser Ala Asp Ala Pro Leu Asn Thr Ser 
1595 1600 1605 

acg ggt acc etc cgc eta caa age gca gca cca ctg ggg eta gtt 27782 
Thr Gly Thr Leu Arg Leu Gin Ser Ala Ala Pro Leu Gly Leu Val 
1610 1615 1620 

gac caa acc ctg cga gtg ctt ttt tct aac cca etc tac ttg caa 27827 
Asp Gin Thr Leu Arg Val Leu Phe Ser Asn Pro Leu Tyr Leu Gin 
1625 1630 1635 

aac aac ttt etc tea eta gcc att gaa cgc cca ttg get tta act 27872 
Asn Asn Phe Leu ser Leu Ala lie Glu Arg Pro Leu Ala Leu Thr 
1640 1645 1650 



acc act ggt tct atg get atg cag att tec caa cca tta aaa gtg 
Thr Thr Gly ser Met Ala Met Gin lie Ser Gin Pro Leu Lys Val 
1655 1660 1665 



27917 



gaa gac gqa age tta age ttg age att gaa age cct eta aat eta 27962 
Glu Asp Gly ser Leu Ser Leu Ser lie Glu Ser Pro Leu Asn Leu 
1670 1675 1680 

aaa aac gqa aat ctt act tta gga acc caa agt ccc eta act gtc 28007 
Lys Asn Gly Asn Leu Thr Leu Gly Thr Gin Ser Pro Leu Thr Val 
1685 1690 1695 

act ggt aac aac etc age ctt aca aca aca gcc cca tta acg gtt 28052 
Thr Gly Asn Asn Leu Ser Leu Thr Thr Thr Ala Pro Leu Thr Val 
1700 1705 1710 

cag aac aac get eta gcc etc tea gtg tta ctg ccg ctt aga eta 28097 
Gin Asn Asn Ala Leu Ala Leu Ser Val Leu Leu Pro Leu Arg Leu 
1715 1720 1725 

ttt aat aac acc tea ctg gga gtg gca ttc aac cca ccc att tct 
Phe Asn Asn Thr Ser Leu Gly Val Ala Phe Asn Pro pro lie Ser 
1730 1735 1740 

tea gca aac aac ggg ctg tct ctt gac att gga aat ggc ctt aca 
Ser Ala Asn Asn Gly Leu Ser Leu Asp lie Gly Asn Gly Leu Thr 
1745 1750 1755 

ctg caa tac aac agg etc gta gtg aac att ggc ggc ggg eta cag 
Leu Gin Tyr Asn Arg Leu Val Val Asn lie Gly Gly Gly Leu Gin 
1760 1765 1770 

ttt aac aac ggt get att acc get tec ata aat gca get ctg ccg 28277 
Phe Asn Asn Gly Ala lie Thr Ala ser lie Asn Ala Ala Leu Pro 
1775 1780 1785 
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ttg cag tat tec aat aac cag ctt tct ctt aat att gga ggc ggg 
Leu Gin Tyr Ser Asn Asn Gin Leu ser Leu Asn lie Gly Gly Gly 



1790 



1795 



1800 



ctg cga tac aac ggc act tac aaa aat tta gec gtc aaa acc gac 

Leu Arg Tyr Asn Gly Thr Tyr Lys Asn Leu Ala Val Lys Thr Asp 

1805 1810 1815 

tct ttt agg ggt ctt gaa att gac agt aat cag ttc ctg gtg cca 

Ser Phe Arg Gly Leu Glu lie Asp Ser Asn Gin Phe Leu val Pro 

1820 1825 1830 



aga ctg ggt tct ggt eta aag ttt gat caa tat ggg tac att age 
Arg Leu Gly Ser Gly Leu Lys Phe Asp Gin Tyr Gly Tyr lie Ser 
1835 1840 1845 



gtc ata cct cca act gtt acg cca aca aca ctt tgg act aca gca 
val lie Pro Pro Thr val Thr Pro Thr Thr Leu Trp Thr Thr Ala 
1850 1855 1860 

gac cct tct ccc aac get act ttt tac gac age tta gat get aag 
Asp Pro Ser Pro Asn Ala Thr Phe Tyr Asp Ser Leu Asp Ala Lys 
1865 1870 1875 

gta tgg ctg gee tta gta aaa tgc aac ggc atg gtt aat gga acc 
val Trp Leu Ala Leu val Lys Cys Asn Gly Met Val Asn Gly Thr 
1880 1885 " 1890 

ata gec ata aag get tta aaa ggt act ctg etc caa cct acg get 
lie Ala lie Lys Ala Leu Lys Gly Thr Leu Leu Gin Pro Thr Ala 
1895 1900 1905 

agt ttt att tct ttt gtt atg tat ttt tac age aat ggc acc aga 
Ser Phe lie ser Phe Val Met Tyr Phe Tyr Ser Asn Gly Thr Arg 



1910 



1915 



1920 



aga act aac tac ccc acg ttt gaa aat gaa ggc ata eta get agt 
Arg Thr Asn Tyr Pro Thr Phe Glu Asn Glu Gly lie Leu Ala Ser 



1925 



1930 



1935 



agt get aca tgg ggt tat cgt caa gga aac teg gca aac acc aac 

Ser Ala Thr Trp Gly Tyr Arg Gin Gly Asn ser Ala Asn Thr Asn 
1940 1945 1950 

gtc acc agt gec gtt gaa ttt atg cct age tec aca aga tat cct 

Val Thr Ser Ala Val Glu Phe Met Pro Ser Ser Thr Arg Tyr Pro 
1955 1960 1965 

gtt aac aag ggt act gag gtt cag aac atg gaa etc acc tac act 

val Asn Lys Gly Thr Glu Val Gin Asn Met Glu Leu Thr Tyr Thr 



1970 



1975 



1980 



ttc ttg cag gga gac ccc act atg gee ata tea ttt caa get att 
Phe Leu Gin Gly Asp Pro Thr Met Ala lie Ser Phe Gin Ala lie 
1985 1990 1995 

tat aac cat get ttg gaa ggt tac tct tta aaa ttt acc tgg cga 
Tyr Asn His Ala Leu Glu Gly Tyr Ser Leu Lys Phe Thr Trp Arg 

2005 



2000 



2010 



gtt cgc aac agg gaa cgc ttt gat ate ccc tgc tgt tct ttt tct 
Val Arg Asn Arg Glu Arg Phe Asp He Pro Cys Cys Ser Phe Ser 
2015 2020 2025 

tac ata acg gaa gaa taa acactgtttt tcttttcaat gtttttattc 
Tyr He Thr Glu Glu 
2030 

tgctttttta cacagttcga accgtcagac tccctccccc cttccacttc acccggtaca 
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cctcccgctc cccctggatc gctgcgtaca actgcagttt ggtgttcaga cacgggttct 29165 

taggtgacag tatccacacg gcctctttgc cggccaggcg ctggtccgta atgctcacaa 29225 

atccctccga cacgtcctcc agacacacgg tggaatccaa ggcgcccgtc tacaaaacaa 29285 

acacagtcat gctctccacg ggttctctcc tcggtcgtac tgcgccagcg tgaacgggcg 29345 

atggtgctcc atcagggctc gcagcaaccg ctgtcggcgc ggctcaccca ggctccggcg 29405 

aaaagcgccc cgtctgggag tgctattcaa aaaacgcacc gcctttatca acagtctcct 29465 

cgtgcggcgg gcgcagcagc gcacctggat ctctgtcagg tctttacaat aggtacagcc 29525 

catcaccacc atgttgttta aaatcccaaa gctaaacacg ctccacccaa atgacatgaa 29585 

ttccagcacc gccgcggcgt ggccatcata caatatgcgg aggtaaatca ggtgccgccc 29645 

cctaatacaa acgctcccca tatacatcac ctccttaggc agttgataat taaccacctc 29705 

ccggtaccag ggaaacctca cgtttactaa agccccaaac accaacattt taaaccagtt 29765 

agccagcacc acccctcccg ccttacactg cagcgacccc ggctgtttac aatgacagtg 29825 

aatcacccac ctctcatacc ccctaatgac ctggcgtggc tccacatcta tagtagcaca 29885 

gcacacgcac accctcatgt aatgcttcat cacaaatctt tcccaagggg ttagtatcat 29945 

gtcccagggt acgggccact cctgcagcac ggtgaaaggt acgcaggcgg gaacagtcct 30005 

cacctcggac acataatgca tattcagatg ttcacactct aaaaccccgg ggcttccctc 30065 

caacgcagcc actggcaagt tctcagaggg tggtgtaagg cggtggtgct gatagggact 30125 

caatctgtgt cgacaccgtc tgtcgcgttg catcgtagac caacgcttgg cgcaccgcct 30185 

cgtacttcgc ccaaagaaaa cgggtgcgac gccaacacac ttccgcgtac cgtgggttcc 30245 

gcactcgagc tcgctcagtt ctcaacgcat aatgcagcca ttcctgtaat ccacacaaca 30305 

gtcgctcggc ttccaaagag atgtgcacct cgtatcttat aacgtcccga tatatatcca 30365 

agcaggcagt cagggccact tgcaaccagt gcacgcaggc ggactgatcg cgacacactg 30425 

gaggtggagg gagagacgga agaggcatgt tactccagac ggtcgaaaag cggatcaaag 30485 

tgcagatcgc gaagatggca gcgatccccg ccgctacgct ggtgatagat cacagccagg 30545 

tcaaacataa tgcggttttc caaatgacct attaccgcct ccaccagagc cgccacgcgc 30605 

acttccagaa acaccagcac ggctacggca ttctcctcaa aatcttcaaa cattaagctg 30665 

catgattgaa tcacccccaa ataattctcc tccttccatt ctcgcaaaat ttgagtaaaa 30725 

acctctcgca gattagctcc gtggcgttca aaaaggtcac ttagagcgcc ctccaccgcc 30785 

atgcgcaagc acaccctcat gattgaaaaa tgccagtctc ctgaaccacc tgcagttgat 30845 

ttaaaagacc tatattagga tcaattccac tctcccgcag ctccacgcgt agcattagct 30905 

gcaaaaagtc atttaaatct tcgcaaacta gcgcggtaag ctcgccgccg ggaattaggt 30965 

ctgaagcagt caccacacac ataatttcca gtgaaggagt cagtctaagc agcaaaaagc 31025 

cgcatgagca gtgttgaaaa ggaggggtca cgcaatgtaa catatgcagc caaaaatctc 31085 

caaggtgtct gtgcataaac tccaccactg aaaagtccaa atcatgtaaa tatgccatca 31145 
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ccacctcaaa 

\— v»y %•»*>. w*_u.*4*4 


aaccaccaca 


gacacaaaaa 


caaaccataq 


caaatacatg gtgtcctgca 


31205 


aagcaaaaac 


acatttatac 


cataqaqqcq 


cgaattactt 


ggggaaaaat cactcgctcc 


31265 


aaaactaaac 


aqqccaccqt 


ctgaccgcgc 


cagccataaa 


aaaagcggtt cgaatgatta 


31325 


aaaagaataa 


tagacacctc 


ccaccaggta 


ctcggctgca 


actcgtgcgc ccctatcaaa 


31385 


accccgcgga 


cgttcatgtc 


ggccatagaa 


aaaatgcggc 


ccaaatatcc caccggaatc 


31445 


tccacggcca 


qctqcaqtqa 


tagcaaaaga 


acgccatgag 


gagcaatcac aaaattttca 


31505 


aacaataaaa 

g3 v y i4 *• ******** 


gcacataaag 


gttagaatag 


ccctgctgca 


caggtaataa agcccgcgag 


31565 


ctcagcaaat 


gcacataaac 


cgcttcagcc 


atcccgtctt 


accgcgaaca aaaggctcac 


31625 


agtacacagt 


tactcaaccc 


acacgccaca 


cagtatttat 


acactcctca atcgccacgt 


31685 


cacccgcccc 


gaacaaactc 


caaaagtcca 


aaaagtccaa 


aacgcccgcg taaaagcccg 


31745 


ccaaaacagc 


acttcctcat 


ttactctccc 


acagtacgtc 


acttccgccg cgcccgccgc 


31805 


cctcgccccg 


ccctcaccct 


cgcgctccac 


cccgcgcccc 


acgtcagact cccaccccgc 


3l5DD 


cccgcgcccg 


cgtcatccgc 


accccaccct 


cactccaccc 


ctaaccccgc ctcctcatta 


31925 


tcatattggc 


accgttccca 


aataaggtat 


attatgatga 


tg 


31967 



<210> 13 
<211> 508 
<212> PRT 

<213> Simian adenovirus 
<400> 13 

Met Arg Arg Ala Val Gly Val Pro Pro val Met Ala Tyr Ala Glu Gly 
15 10 15 

Pro pro Pro ser Tyr Glu Thr Val Met Gly Ala Ala Asp Ser Pro Ala 
20 25 30 

Thr Leu Glu Ala Leu Tyr Val Pro Pro Arg Tyr Leu Gly Pro Thr Glu 
35 40 45 

Gly Arg Asn ser lie Arg Tyr Ser Glu Leu Ala Pro Leu Tyr Asp Thr 
50 55 60 

Thr Arg Val Tyr Leu Val Asp Asn Lys ser Ala Asp lie Ala Ser Leu 
65 ~ 70 75 80 

Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val Val Gin Asn 
85 90 95 

Asn Asp Phe Thr Pro Val Glu Ala Gly Thr Gin Thr lie Asn Phe Asp 
100 105 110 

Glu Arg Ser Arg Trp Gly Gly Asp Leu Lys Thr lie Leu Arg Thr Asn 
115 120 125 

Met Pro Asn lie Asn Glu Phe Met Ser Thr Asn Lys Phe Arg Ala Arg 
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130 135 140 

Leu Met val Glu Lys Val Asn Lys Glu Thr Asn Ala Pro Arg Tyr Glu 
145 150 155 160 

Trp Phe Glu Phe Thr Leu Pro Glu Gly Asn Tyr ser Glu Thr Met Thr 
165 170 175 

lie Asp Leu Met Asn Asn Ala lie Val Asp Asn Tyr Leu Glu Val Gly 
180 185 190 

Arg Gin Asn Gly val Leu Glu Ser Asp lie Gly val Lys Phe Asp Thr 
195 200 205 

Arg Asn Phe Arg Leu Gly Trp Asp Pro val Thr Lys Leu val Met Pro 
210 215 220 

Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp lie Val Leu Leu Pro 
225 230 235 240 

Gly cys Gly val Asp Phe Thr Gin ser Arg Leu Ser Asn Leu Leu Gly 
245 250 255 

lie Arg Lys Arg Met Pro Phe Gin Ala Gly Phe Gin lie Met Tyr Glu 
260 265 270 

Asp Leu Glu Gly Gly Asn lie Pro Ala Leu Leu Asp Val Ala Lys Tyr 
275 280 285 

Glu Ala ser lie Gin Lys Ala Arg Glu Gin Gly Gin Glu He Arg Gly 
290 295 300 

Asp Asn Phe Thr val lie Pro Arg Asp Val Glu lie Val Pro Val Glu 
305 310 315 320 

Lys Asp Ser Lys Asp Arg Ser Tyr Asn Leu Leu Pro Gly Asp Gin Thr 
325 330 335 

Asn Thr Ala Tyr Arg Ser Trp Phe Leu Ala Tyr Asn Tyr Gly Asp Pro 
340 ~ 345 350 

Glu Lys Gly val Arg ser Trp Thr Leu Leu Thr Thr Thr Asp Val Thr 
355 ~ 360 365 

Cys Gly Ser Gin Gin val Tyr Trp Ser Leu Pro Asp Met Met Gin Asp 
370 375 380 

Pro val Thr Phe Arg Pro Ser Ser Gin Val Ser Asn Tyr Pro Val Val 
385 390 395 400 

Gly val Glu Leu Leu Pro Val His Ala Lys Ser Phe Tyr Asn Glu Gin 
405 410 415 
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Ala val Tyr Ser Gin Leu lie Arg Gin Ser Thr Ala Leu Thr His Val 
420 425 430 



Phe Asn Arg Phe Pro Glu Asn Gin lie Leu Val Arg Pro Pro Ala Pro 
435 440 445 



Thr lie Thr Thr val Ser Glu Asn val Pro Ala Leu Thr Asp His Gly 
450 455 460 



Thr Leu Pro Leu Arg Ser Ser lie Ser Gly val Gin Arg Val Thr lie 
465 470 475 480 



Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr Val His Lys Ala Leu Gly 
485 490 495 



lie Val Ala Pro Lys Val Leu Ser Ser Arg Thr Phe 
500 505 



<210> 14 
<211> 917 
<212> PRT 

<213> Simian adenovirus 
<400> 14 

Met Ala Thr Pro Ser Met Met Pro Gin Trp ser Tyr Met His lie Ala 
15 10 15 



Gly Gin Asp Ala Ser Glu Tyr Leu Ser Pro Gly Leu Val Gin Phe Ala 
20 25 30 



Arg Ala Thr Asp Thr Tyr Phe ser Leu Gly Asn Lys Phe Arg Asn Pro 
35 40 45 



Thr val Ala Pro Thr His Asp val Thr Thr Asp Arg ser Gin Arg Leu 
50 55 60 



Thr Leu Arg Phe val Pro val Asp Arg Glu Asp Thr Ala Tyr ser Tyr 
65 70 75 80 



Lys Val Arg Phe Thr Leu Ala Val Gly Asp Asn Arg Val Leu Asp Met 
85 90 95 



Ala ser Thr Tyr Phe Asp lie Arg Gly val Leu Asp Arg Gly Pro Ser 
100 105 110 



Phe Lys Pro Tyr Ser Gly Thr Ala Tyr Asn Ser Leu Ala Pro Lys Gly 
115 120 125 



Ala Pro Asn Pro Ser Glu Trp Lys Gly ser Asp Asn Lys lie Ser Val 



130 




140 
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Arg Gly Gin Ala Pro Phe Phe ser Thr Ser lie Thr Lys Asp Gly He 
145 150 155 160 

Gin val Ala Thr Asp Thr Ser ser Gly Ala Val Tyr Ala Lys Lys Glu 
165 170 175 

Tyr Gin Pro Glu Pro Gin val Gly Gin Glu Gin Trp Asn Ser Glu Ala 
180 185 190 

ser Asp ser Asp Lys val Ala Gly Arg lie Leu Lys Asp Thr Thr Pro 
195 200 205 

Met Phe Pro Cys Tyr Gly ser Tyr Ala Lys Pro Thr Asn Glu Gin Gly 
210 215 220 

Gly Gin Gly Thr Asn Thr val Asp Leu Gin Phe Phe Ala Ser Ser ser 
225 230 235 240 

Ala Thr ser Thr Pro Lys Ala Val Leu Tyr Ala Glu Asp val Ala lie 
245 250 255 

Glu Ala Pro Asp Thr His Leu Val Tyr Lys Pro Ala Val Thr Thr Thr 
260 265 270 

Thr Thr Ser Ser Gin Asp Leu Leu Thr Gin Gin Ala Ala Pro Asn Arg 
275 280 285 

Pro Asn Tyr He Gly Phe Arg Asp Asn Phe lie Gly Leu Met Tyr Tyr 
290 295 300 

Asn Ser Thr Gly Asn Met Gly Val Leu Ala Gly Gin Ala Ser Gin Leu 
305 310 315 320 

Asn Ala Val Val Asp Leu Gin Asp Arg Asn Thr Glu Leu Ser Tyr Gin 
325 330 335 

Leu Met Leu Asp Ala Leu Gly Asp Arg Ser Arg Tyr Phe Ser Met Trp 
340 345 350 

Asn Gin Ala Val Asp ser Tyr Asp Pro Asp Val Arg lie lie Glu Asn 
355 360 365 

His Gly Val Glu Asp Glu Leu Pro Asn Tyr Cys Phe Pro Leu Gly Gly 
370 375 380 

ser Leu Val Thr Glu Thr Tyr Thr Gly Leu Ser Pro Gin Asn Gly ser 
385 390 395 400 

Asn Thr Trp Thr Thr Asp Ser Thr Thr Tyr Ala Thr Arg Gly val Glu 
405 410 " 415 

lie Gly ser Gly Asn Met Phe Ala Met Glu He Asn Leu Ala Ala Asn 
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420 425 430 

Leu Trp Arg ser Phe Leu Tyr ser Asn val Ala Leu Tyr Leu Pro Asp 
435 440 445 

Glu Tyr Lys Leu Thr Pro Asp Asn He Thr Leu Pro Asp Asn Lys Asn 
450 455 460 

Thr Tyr Asp Tyr Met Asn Gly Arg Val Ala Ala Pro Ser ser Leu Asp 
465 470 475 480 

Thr Tyr val Asn lie Gly Ala Arg Trp ser Pro Asp Pro Met Asp Asn 
485 ~ 490 495 

val Asn Pro Phe Asn His His Arg Asn Ala Gly Leu Arg Tyr Arg ser 
500 505 510 

Met Leu Leu Gly Asn Gly Arg Tyr val Pro Phe His lie Gin Val Pro 
515 520 525 

Gin Lys Phe Phe Ala lie Lys Asn Leu Leu Leu Leu Pro Gly Ser Tyr 
530 535 540 

Thr Tyr Glu Trp Asn Phe Arg Lys Asp val Asn Met lie Leu Gin Ser 
545 550 555 560 

ser Leu Gly Asn Asp Leu Arg val Asp Gly Ala Ser val Arg Phe Asp 
565 570 575 

Ser He Asn Leu Tyr Ala Asn Phe Phe Pro Met Ala His Asn Thr Ala 
580 585 590 

Ser Thr Leu Glu Ala Met Leu Arg Asn Asp Thr Asn Asp Gin Ser Phe 
595 600 605 

Asn Asp Tyr Leu Cys Ala Ala Asn Met Leu Tyr Pro lie Pro Ala Asn 
610 615 620 

Ala Thr ser val Pro lie Ser lie Pro Ser Arg Asn Trp Ala Ala Phe 
62 5 630 635 640 

Arg Gly Trp ser Phe Thr Arg Leu Lys Thr Lys Glu Thr Pro Ser Leu 
645 650 655 

Gly Ser Gly phe Asp Pro Tyr Phe Thr Tyr ser Gly Ser He Pro Tyr 
660 665 ' 670 

Leu Asp Gly Thr Phe Tyr Leu Asn His Thr Phe Lys Lys Val ser lie 
675 680 685 

Met Phe Asp Ser ser val ser Trp Pro Gly Asn Asp Arg Leu Leu Thr 
690 695 700 
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Pro Asn Glu Phe Glu lie Lys Arg Thr val Asp Gly Glu Gly Tyr Asn 
705 710 " 715 720 

Val Ala Gin Cys Asn Met Thr Lys Asp Trp Phe Leu He Gin Met Leu 
725 730 735 

Ser His Tyr Asn lie Gly Tyr Gin Gly Phe Tyr val Pro Glu Gly Tyr 
740 745 ' 750 

Lys Asp Arg Met Tyr Ser Phe Phe Arg Asn Phe Gin Pro Met Ser Arg 
755 760 765 

Gin val val Asp Thr Thr Thr Tyr Thr Asp Tyr Lys Asn Val Thr Leu 
770 775 780 

Pro Phe Gin His Asn Asn ser Gly Phe Val Gly Tyr Met Gly Pro Thr 
785 790 795 800 

Met Arg Glu Gly Gin Ala Tyr Pro Ala Asn Tyr Pro Tyr Pro Leu lie 
805 810 815 

Gly Lys Thr Ala Val Pro Ser Leu Thr Gin Lys Lys Phe Leu cys Asp 
820 825 830 

Arg Thr Met Trp Arg lie Pro Phe Ser ser Asn Phe Met Ser Met Gly 
835 840 845 

Ala Leu Thr Asp Leu Gly Gin Asn Met Leu Tyr Ala Asn Ser Ala His 
850 855 860 

Ala Leu Asp Met Thr phe Glu val Asp Pro Met Asp Glu Pro Thr Leu 
865 870 875 880 

Leu Tyr val Leu Phe Glu val Phe Asp val Val Arg lie His Gin Pro 
885 890 " 895 

His Arg Gly val lie Glu Ala Val Tyr Leu Arg Thr Pro Phe Ser Ala 
900 905 ~ 910 

Gly Asn Ala Thr Thr 
915 

<210> 15 
<211> 607 
<212> PRT 

<213> Simian adenovirus 
<400> 15 

Met Lys Arg Ala Arg Leu Asp Asp Asp Phe Asn Pro val Tyr Pro Tyr 
15 10 15 
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Asp Thr Pro Asn Ala Pro Ser val Pro Phe lie Thr Pro Pro Phe val 
20 25 30 

Ser Ser Asp Gly Leu Gin Glu Lys Pro Pro Gly Met Leu Ser Leu Asn 
35 40 45 

Tyr Gin Asp Pro lie Thr Thr Gin Asn Gly Ala Leu Thr Leu Lys Leu 
50 55 60 

Gly Ser Gly Leu Asn lie Asn Gin Asp Gly Glu Leu Thr Ser Asp Ala 
65 70 75 80 

Ser Val Leu Val Thr Pro Pro lie Thr Lys Ala Asn Asn Thr lie Gly 
85 90 95 

Leu Ala Phe Asn Ala Pro Leu Thr Leu Gin Ser Asp Thr Leu Asn Leu 
100 105 110 

Ala Cys Asn Ala Pro Leu Thr Val Gin Asp Asn Arg Leu Gly lie Thr 
115 120 125 

Tyr Asn ser Pro Leu Thr Leu Gin Asn Ser Glu Leu Ala Leu Ala val 
130 135 140 

» 

Thr Pro Pro Leu Asp Thr Ala Asn Asn Thr Leu Ala Leu Lys Thr Ala 
145 150 155 160 

Arg Pro lie lie Thr Asn Ser Asn Asn Glu Leu Thr Leu Ser Ala Asp 
165 170 175 

Ala Pro Leu Asn Thr ser Thr Gly Thr Leu Arg Leu Gin Ser Ala Ala 
180 185 190 

Pro Leu Gly Leu Val Asp Gin Thr Leu Arg Val Leu Phe Ser Asn Pro 
195 200 205 

Leu Tyr Leu Gin Asn Asn Phe Leu ser Leu Ala lie Glu Arg Pro Leu 
210 215 220 

Ala Leu Thr Thr Thr Gly ser Met Ala Met Gin He ser Gin Pro Leu 
225 230 235 240 

Lys Val Glu Asp Gly ser Leu ser Leu Ser lie Glu Ser Pro Leu Asn 
245 250 255 

Leu Lys Asn Gly Asn Leu Thr Leu Gly Thr Gin Ser Pro Leu Thr Val 
260 265 270 

Thr Gly Asn Asn Leu ser Leu Thr Thr Thr Ala Pro Leu Thr Val Gin 
275 280 285 

Asn Asn Ala Leu Ala Leu Ser val Leu Leu Pro Leu Arg Leu Phe Asn 
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290 295 300 

Asn Thr Ser Leu Gly val Ala Phe Asn Pro Pro lie Ser Ser Ala Asn 
305 310 315 320 

Asn Gly Leu Ser Leu Asp lie Gly Asn Gly Leu Thr Leu Gin Tyr Asn 
325 330 335 

Arg Leu Val Val Asn lie Gly Gly Gly Leu Gin Phe Asn Asn Gly Ala 
340 345 350 

lie Thr Ala Ser lie Asn Ala Ala Leu Pro Leu Gin Tyr ser Asn Asn 
355 360 365 

Gin Leu Ser Leu Asn He Gly Gly Gly Leu Arg Tyr Asn Gly Thr Tyr 
370 375 380 

Lys Asn Leu Ala Val Lys Thr Asp Ser Phe Arg Gly Leu Glu lie Asp 
385 390 395 400 

Ser Asn Gin Phe Leu Val Pro Arg Leu Gly ser Gly Leu Lys Phe Asp 
405 410 415 

Gin Tyr Gly Tyr lie Ser Val lie Pro Pro Thr Val Thr Pro Thr Thr 
420 425 430 

Leu Trp Thr Thr Ala Asp Pro Ser Pro Asn Ala Thr Phe Tyr Asp Ser 
435 440 445 

Leu Asp Ala Lys Val Trp Leu Ala Leu Val Lys Cys Asn Gly Met Val 
450 455 460 

Asn Gly Thr lie Ala lie Lys Ala Leu Lys Gly Thr Leu Leu Gin Pro 
465 470 475 480 

Thr Ala Ser Phe lie Ser Phe Val Met Tyr Phe Tyr Ser Asn Gly Thr 
485 490 495 

Arg Arg Thr Asn Tyr Pro Thr Phe Glu Asn Glu Gly lie Leu Ala ser 
500 505 510 

Ser Ala Thr Trp Gly Tyr Arg Gin Gly Asn Ser Ala Asn Thr Asn Val 
515 520 525 

Thr ser Ala Val Glu Phe Met Pro ser ser Thr Arg Tyr Pro Val Asn 
530 535 540 

Lys Gly Thr Glu Val Gin Asn Met Glu Leu Thr Tyr Thr Phe Leu Gin 
545 550 555 560 

Gly Asp pro Thr Met Ala lie Ser Phe Gin Ala lie Tyr Asn His Ala 
565 570 575 
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Leu 61 u Gly Tyr ser Leu Lys Phe Thr Trp Arg val Arg Asn Arg Glu 
580 585 590 

Arg Phe Asp lie Pro Cys Cys Ser Phe Ser Tyr lie Thr Glu Glu 
595 600 605 

<210> 16 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic oligomer 
<400> 16 

gcgacgggcc gacgctgccc ggct 24 

<210> 17 

<211> 4 

<212> PRT 

<213> Artificial 

<220> 

<223> artifical 

<400> 17 

Arg Arg Ala Ser 

<210> 18 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic oligomer 
<400> 18 

gcggcgcgcc gacgctgccc ggct 24 
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